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PRELIMINARY AMENDMENT 
Please amend the above-identified patent application as follows: 

In the Specification 

After page 17 : Please insert as new page 18 the attached Abstract of the Disclosure. 
In the claims 

The following amendments are made with respect to the claims in the international 
application PCT/GB99/03721 attached as Annexes to the International Preliminary Examination 
Report (PER). Therefore, please replace existing page 17 of the international application with 
the amended claim sheet (replacement page 17) of the annex attached to the IPER, and make the 
following amendments to the pending claims so that they read as follows: 
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Claim 1 (amended) : 

An isolated peptide encoded by an operon, wherein said operon comprises a gene 
selected from the group consisting of tatA, tatB, tatC, tatE, mdoG, creC, recG, yggN, eckl, iroD, 
iroC, iroE, mtd2, and msl to msl6, obtainable from E. coli Kl, or a homologue thereof in a 
Gram-negative bacterium, wherein said homologue has at least 30% homology at the amino acid 
or nucleotide level, or a functional fragment thereof. 

Claim 2 (amended) : 

The isolated peptide, according to claim 1, comprising an amino acid sequence selected 
from the group consisting of SEQ ID NOS. 2, 5, 7, 9, 11, 12, 13, 14, 16, 18, 19, 21, 23, 24, 25, 
26, 28, 29,31,32 and 35-48. 

Claim 3 (amended) : 

An isolated polynucleotide which comprises a gene selected from the group consisting of 
tatA, tatB, tatC, tatE, mdoG, creC, recG, yggN, eckl, iroD, iroC, iroE, mtd2, and msl to msl 6, 
obtainable from E. coli Kl, or a homologue thereof in a Gram-negative bacterium, wherein said 
homologue has at least 30% homology at the amino acid or nucleotide level, or a functional 
fragment thereof. 

Claim 4 (amended) : 

A host transformed to express a peptide encoded by an operon, wherein said operon 
comprises a gene selected from the group consisting of tatA, tatB, tatC, tatE, mdoG, creC, recG, 
yggN, eckl, iroD, iroC, iroE, mtd2, and msl to msl 6, obtainable from E. coli Kl, or a 
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homologue thereof in a Gram-negative bacterium, wherein said homologue has at least 30% 
homology at the amino acid or nucleotide level, or a functional fragment thereof. 



Claim 5 (amended) : 

A vaccine comprising a peptide, or the means for its expression, wherein said peptide is 
encoded by an operon, wherein said operon comprises a gene selected from the group consisting 
of tat A, tatB, tatC, tatE, mdoG, creC, recG, yggN, eckl, iroD, iroC, iroE, mtd2, and msl to 
msl6, obtainable from E. coli Kl, or a homologue thereof in a Gram-negative bacterium, 
wherein said homologue has at least 30% homology at the amino acid or nucleotide level, or a 
functional fragment thereof. 

Claim 6 (amended) : 

A vaccine comprising a microorganism having a virulence gene mutation, wherein the 
gene is selected from the group consisting of tatA, tatB, tatC, tatE, mdoG, creC, recG, yggN, 
eckl, iroD, iroC, iroE, mtd2, and msl to msl 6, obtainable from E. coli Kl, or a homologue 
thereof in a Gram-negative bacterium, wherein said homologue has at least 30% homology at the 
amino acid or nucleotide level, or a functional fragment thereof. 

Claim 7 (amended) : 

The vaccine, according to claim 6, wherein said virulence gene mutation comprises a 
virulence gene deletion in two genes, wherein one gene encodes tatA and the other encodes tatE. 
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Claim 8 (amended) : 

The vaccine, according to claim 6, wherein the gene lies within a pathogenicity island. 



Claim 9 (amended) : 

A method for screening potential drugs, or for the detection of virulence, wherein said 
method utilizes a peptide encoded by an operon, wherein said operon comprises a gene selected 
from the group consisting of tat A, tatB, tatC, tatE, mdoG, creC, recG, yggN, eckl, iroD, iroC, 
iroE, mtd2, and msl to msl6, obtainable from E. coli Kl, or a homologue thereof in a Gram- 
negative bacterium, wherein said homologue has at least 30% homology at the amino acid or 
nucleotide level, or a functional fragment thereof. 

Claim 10 (amended) : 

A method for treatment or prevention of a condition associated with infection by a Gram- 
negative bacterium, said method comprising administering a vaccine to a person or animal in 
need thereof, wherein said vaccine comprises a peptide, or a host transformed to express said 
peptide, wherein said peptide is encoded by an operon comprising a gene selected from the group 
consisting of tatA, tatB, tatC, tatE, mdoG, creC, recG, yggN, eckl, iroD, iroC, iroE, mtd2, and 
msl to msl6, obtainable from E. coli Kl, or a homologue thereof in a Gram-negative bacterium, 
wherein said homologue has at least 30% homology at the amino acid or nucleotide level, or a 
functional fragment thereof. 

Claim 1 1 (amended) : 

The method, according to claim 10, wherein the bacterium is E. coli. 
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Please add the following new claims: 

12. The polynucleotide, according to claim 3, wherein said gene encodes a peptide 
comprising an amino acid sequence selected from the group consisting of SEQ ID NOS. 2, 5, 7, 
9, 11, 12, 13, 14, 16, 18, 19, 21, 23, 24, 25, 26, 28, 29, 31, 32 and 35-48. 

13. The host, according to claim 4, wherein said peptide comprises an amino acid 
sequence selected from the group consisting of SEQ ID NOS. 2, 5, 7, 9, 11, 12, 13, 14, 16, 18, 
19, 21, 23, 24, 25, 26, 28, 29, 31, 32 and 35-48. 

14. The vaccine, according to claim 5, wherein said peptide comprises an amino acid 
sequence selected from the group consisting of SEQ ID NOS. 2, 5, 7, 9, 11, 12, 13, 14, 16, 18, 
19, 21, 23, 24, 25, 26, 28, 29, 31, 32 and 35-48. 

15. The vaccine, according to claim 6, wherein said peptide comprises an amino acid 
sequence selected from the group consisting of SEQ ID NOS. 2, 5, 7, 9, 11, 12, 13, 14, 16, 18, 
19, 21, 23, 24, 25, 26, 28, 29, 31, 32 and 35-48. 

16. The vaccine, according to claim 15, wherein said virulence gene mutation comprises 
a virulence gene deletion in two genes, wherein one gene encodes tatA and the other encodes 
tatE. 
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17. The vaccine, according to claim 15, wherein the gene lies within a pathogenicity 

island. 



18. The method, according to claim 9, wherein said peptide comprises an amino acid 
sequence selected from the group consisting of SEQ ID NOS. 2, 5, 7, 9, 11, 12, 13, 14, 16, 18, 
19, 21, 23, 24, 25, 26, 28, 29, 31, 32 and 35-48. 

19. The method, according to claim 9, wherein said peptide comprises an amino acid 
sequence as set forth in SEQ ID NO. 33. 

20. The method, according to claim 10, wherein said peptide comprises an amino acid 
sequence selected from the group consisting of SEQ ID NOS. 2, 5, 7, 9, 11, 12, 13, 14, 16, 18, 
19, 21, 23, 24, 25, 26, 28, 29, 31, 32 and 35-48. 

21. The method, according to claim 20, wherein the bacterium is E. coli. 

22. A method for treatment or prevention of a condition associated with infection by a 
Gram-negative bacterium, said method comprising administering a nucleotide to a person or 
animal in need thereof, wherein said nucleotide comprises an operon including a gene selected 
from the group consisting of tatA, tatB r tatC, tatE, mdoG, creC, recG, yggN, eckl, iroD, iroQ 
iroE, mtd2, and msl to msl6, obtainable from E. coli Kl, or a homologue thereof in a Gram- 
negative bacterium, wherein said homologue has at least 30% homology at the amino acid or 
nucleotide level, or a functional fragment thereof. 



G:\SH-RESP\GJE-65-prelamend.doc/DNB/la 



7 Docket No. GJE-65 

Remarks 

Claims 1-11 have been amended and new claims 12-22 have been added. 
No new matter has been added by these amendments. 

The Commissioner is hereby authorized to charge any fees under 37 CFR 1.16 or 1.17 
required by this paper to Deposit Account 19-0065. 



Respectfully Submitted 



Y 



Glenn P. Ladwig 

Patent Attorney 

Registration No. 46,853 

Phone No.: 352-375-8100 

Address: Saliwanchik, Lloyd & Saliwanchik 
2421 N.W. 41 st Street 
Suite A- 1 Gainesville, FL 32606 
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Marked-up Version of Amended Claims 

Claim 1 (amended) : 

[A] An isolated peptide encoded by an operon^ [including any of the genes identified 
herein as] wherein said operon comprises a gene selected from the group consisting of tat A, tatB, 
tatC, tatE, mdoG, creC, recG, yggN, eckl, iroD, iroC, iroE, mtd2 i and msl to [16] ms!6 , 
obtainable from E. coli Kl, or a homologue thereof in a Gram-negative bacterium, [having] 
wherein said homologue has at least 30% homology at the amino acid or nucleotide level, or a 
functional fragment thereof[, for therapeutic use]. 

Claim 2 (amended) : 

[A] The isolated peptide 2 according to claim 1, comprising [any of the amino acid 
sequences identified herein as] an amino acid sequence selected from the group consisting of 
SEQ ID NOS. 2, 5, 7, 9, 11, 12, 13, 14, 16, 18, 19,21,23,24, 25, 26, 28,29,31,32 and 35-48. 

Claim 3 (amended) : 

[A] An isolated polynucleotide [encoding a peptide according to claim 1 or claim 2, for 
therapeutic use] which comprises a gene selected from the group consisting of tatA, tatB, tatC, 
tatE, mdoG, creC, recG, yggN, eckl, iroD, iroC, iroE, mtdl, and msl to msl 6, obtainable from 
E. coli KL or a homologue thereof in a Gram-negative bacterium, wherein said homologue has 
at least 30% homology at the amino acid or nucleotide level or a functional fragment thereof . 
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Claim 4 (amended) : 

A host transformed to express a peptide [according to claim 1 or claim 2] encoded by an 
operon, wherein said operon comprises a gene selected from the group consisting of tat A, tatB, 
tatC, tatE, mdoG, creC, recG, yggN, eckl, iroD, iroC, iroE, mtd2, and msl to msl 6, obtainable 
from E. coli Kl, or a homologue thereof in a Gram-negative bacterium, wherein said homologue 
has at least 30% homology at the amino acid or nucleotide level, or a functional fragment 
thereof . 

Claim 5 (amended) : 

A vaccine comprising a peptide [according to claim 1 or claim 2], or the means for its 
expression , wherein said peptide is encoded by an operon, wherein said operon comprises a gene 
selected from the group consisting of tatA, tatB, tatC, tatE, mdoG, creC, recG, vssN, eckl. iroD, 
iroC, iroE, mtd2, and msl to msl 6, obtainable from E. coli Kl, or a homologue thereof in a 
Gram-negative bacterium, wherein said homologue has at least 30% homology at the amino acid 
or nucleotide level, or a functional fragment thereof . 

Claim 6 (amended) : 

A vaccine comprising a microorganism having a virulence gene mutation, wherein the 
gene is selected from the group consisting of tat A, tatB, tatC, tatE, mdoG, creC, recG, yggN, 
eckl, iroD, iroC, iroE, mtd2, and msl to msl 6, obtainable from E. coli Kl, or a homologue 
thereof in a Gram-negative bacterium, wherein said homologue has at least 30% homology at the 
amino acid or nucleotide level, or a functional fragment thereof [according to claim 1 or claim 
2]- 
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Claim 7 (amended) : 

[A] The vaccine;, according to claim 6, [having] wherein said virulence gene mutation 
comprises a virulence gene deletion in two genes, wherein one gene encodes tatA and the other 
encodes tatE. 

Claim 8 (amended) : 

[A] The vaccine., according to claim 6, wherein the gene lies within a pathogenicity 
island[, wherein the island comprises a gene identified herein]. 

Claim 9 (amended) : 

[Use of a product according to any of claims 1 to 4, or SEQ ID NO. 33,] A method for 
screening potential drugs,, or for the detection of virulence , wherein said method utilizes a 
peptide encoded by an operon, wherein said operon comprises a gene selected from the group 
consisting of tatA. tatB, tatC, tatE, mdoG, creC. recG, vssN, eckl, iroD, iroC, iroE, mtd2, and 
msl to ms!6, obtainable from E. coli Kl, or a homologue thereof in a Gram-negative bacterium, 
wherein said homologue has at least 30% homology at the amino acid or nucleotide level, or a 
functional fragment thereof . 

Claim 10 (amended) : 

[Use of a product according to any of claims 1 to 4, for the manufacture of a medicament 
for use in the] A method for treatment or prevention of a condition associated with infection by a 
Gram-negative bacterium, said method comprising administering a vaccine to a person or animal 
in need thereof, wherein said vaccine comprises a peptide, or a host transformed to express said 
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peptide, wherein said peptide is encoded by an operon comprising a gene selected from the group 
consisting of tatA, tatB, tatC. tatE, mdoG, creC, recG, yggN, eckl, iroD, iroC, iroE, mtd2, and 
msl to ms!6, obtainable from E. coli Kl, or a homologue thereof in a Gram-negative bacterium, 
wherein said homologue has at least 30% homology at the amino acid or nucleotide level or a 
functional fragment thereof . 
Claim 1 1 (amended) : 

[Use] The method, according to claim 10, wherein the bacterium is E. coli. 
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VIRULENCE GENES AND PROTEINS, AND THEIR US E^ 
Field of the Invention 

This invention relates to the identification of virulence genes and 
proteins, and their use. More particularly, it relates to their use in therapy and 
5 in screening for drugs. 

Background to the Invention 

E. coli is a member of the Enterobacteriaceae, or enteric bacteria, which 
are Gram-negative microorganisms that populate the intestinal tracts of 
animals. Other members of this bacterial family include Enterobacter, 
io Klebsiella, Salmonella, Shigella and Yersinia. Although E. coll is found normally 
in the human gastrointestinal tract, it has been implicated in human disease, 
including septicaemia, meningitis, urinary tract infection, wound infection, 
abscess formation, peritonitis and cholangitis. 

The disease states caused by E. coll are dependent upon certain 
15 virulence determinants. For example, E. coli has been implicated in neonatal 
meningitis and a major determinant of virulence has been identified as the K1 
antigen, which is a homopolymer of sialic acid. The K1 antigen may have a role 
in avoiding the host's immunological system and preventing phagocytosis. 
Summary of the Invention 
20 The present invention is based on the identification of a series of 

virulence genes in E. coli K1 , and also related organisms the products of which 
may be implicated in the pathogenicity of the organism. 

According to one aspect of the present invention, a peptide is encoded 
by an operon including any of the genes identified herein as mdoG, creC, recG, 
25 yggN, tatA, tatB, tatC, tatE, eckl, iroD, IroC, iroE, mtdl and ms1 to 16, from E. 
coli K1, or a homologue thereof in a Gram-negative bacterium, or a functional 
fragment thereof. Such a peptide is suitable for therapeutic use, e.g. when 
isolated. 

The term "functional fragments" is used herein to define a part of the 
30 gene or peptide which retains similar therapeutic utility as the whole gene or 
peptide. For example, a functional fragment of the peptide may be used as an 
antigenic determinant, useful in a vaccine or in the production of antibodies. 
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A gene fragment may be used to encode the active peptide. Alternatively, the 
gene fragment may have utility in gene therapy, targetting the wild-type gene 
in vivo to exert a therapeutic effect. 

A peptide according to the present invention may comprise any of the 
5 amino acid sequences identified herein as SEQ ID NOS. 2, 5, 7, 9, 1 1 , 12, 13, 
14, 16, 23, 24, 25, 26, 28, 31, 29, 32 and 35-48. 

The identification of these peptides as virulence determinants allows 
them to be used in a number of ways in the treatment of infection. For example, 
a host may be transformed to express a peptide according to the invention or 
10 modified to disrupt expression of the gene encoding the peptide. A vaccine 
may also comprise a peptide according to the invention, or the means for its 
expression, for the treatment of infection. In addition, a vaccine may comprise 
a microorganism having a virulence gene deletion, wherein the gene encodes 
a peptide according to the invention. 
15 According to another aspect of the invention, the peptides or genes may 

be used for screening potential antimicrobial drugs or for the detection of 
virulence. 

A further aspect of this invention is the use of any of the products 
identified herein, for the treatment or prevention of a condition associated with 
20 infection by a Gram-negative bacterium, in particular by E. coli. 
Description of the Invention 

The present invention has made use of signature-tagged mutagenesis 
(STM) (Hensel ef a/, Science, 1995;269:400-403) to screen E. coli K1 strain 
RS228 (Pluschke et al, Infection and Immunity 39:599-608) mini-Tn5 mutant 
25 bank for attenuated mutants, to identify virulence genes (and virulence 
determinants) of E. coli 

Although E. coli K1 was used as the microorganism to identify the 
virulence genes, corresponding genes in other enteric bacteria are considered 
to be within the scope of the present invention. For example, corresponding 
30 genes or encoded proteins may be found, based on sequence homology, in 
Enterobacter, Klebsiella and other genera implicated in human intestinal 
disease, including Salmonella, Shigella and Yersinia. 
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The term "virulence determinant" is used herein to define a product, e.g. 
a peptide or protein that may have a role in the maintenance of pathogenic 
bacteria. In particular, a virulence determinant is a bacterial protein or peptide 
that is implicated in the pathogenicity of the infectious or disease-causing 
5 microorganism. 

A gene that encodes a virulence determinant may be termed a "virulence 
gene". Disruption of a virulence gene by way of mutation, deletion or insertion, 
will result in a reduced level of survival of the bacteria in a host, or a general 
reduction in the pathogenicity of the microorganism. 
10 Signature-tagged mutagenesis has proved a very useful technique for 

identifying virulence genes, and their products. The technique relies on the 
ability of transposons to insert randomly into the genome of a microorganism, 
under permissive conditions. The transposons are individually marked for easy 
identification, and then introduced separately into a microorganism, resulting 
15 in disruption of the genome. Mutated microorganisms with reduced virulence 
are then detected by negative selection and the genes where insertionai 
inactivation has occurred are identified and characterised. 

A first stage in the STM process is the preparation of suitable 
transposons ortransposon-like elements. A library of different transposons are 
20 prepared, each being incorporated into a vector or plasmid to facilitate transfer 
into the microorganism. The preparation of vectors with suitable transposons 
will be apparent to a skilled person in the art and is further disclosed in WO-A- 
96/17951. For the Gram-negative bacteria, e.g. E. coli, suitable transposons 
include Tn5 and Tn10. Having prepared the transposons, mutagenesis of a 
25 bacterial strain is then carried out to create a library of individually mutated 
bacteria. 

Pools of the mutated microorganisms are then introduced into a suitable 
host. After a suitable length of time, the microorganisms are recovered from the 
host and those microorganisms that have survived in the host are identified, 
30 thereby also identifying the mutated strains that failed to survive, i.e. avirulent 
strains. Corresponding avirulent strains in a stored library are then used to 
identify the genes where insertionai inactivation occurred. Usually, the site of 
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transposon insertion is identified by isolating the DNA flanking the transposons 
insertion site, and this permits characterisation of the genes implicated in 
virulence. 

Once an aviruient microorganism has been identified, it is possible to 
determine more fully the potential role of the mutated gene in virulence, by 
infecting a suitable host animal with a lethal dose of the mutant. The survival 
time of the infected animal is compared with that of a control infected with the 
wild-type strain, and those animals surviving for longer periods than the control 
may be said to be infected with microorganisms having mutated virulence 
genes. 

Alternatively, the potential role in virulence can be investigated by 
infecting an animal host with a mixture of the wild-type and mutant bacteria. 
After a suitable period of time, bacteria are harvested from organs of the host 
animal and the ratio of wild-type and mutant bacteria determined. This ratio is 
divided by the ratio of mutant to wild-type bacteria in the inoculum, to determine 
the competitive index (CI). Mutants which have a competitive index of less than 
1 may be said to be aviruient. 

It is possible that the gene which is inactivated by the insertion of the 
transposon may not be a true virulence gene, but may be having a polar effect 
on a downstream (virulence) gene. This can be determined by further 
experimentation, placing non-polar mutations in more defined regions of the 
gene, or mutating other adjacent genes, and establishing whether or not the 
mutant is aviruient. 

Having characterised a virulence gene in E. coli, it is possible to use the 
gene sequence to establish homologies in other microorganisms. In this way 
it is possible to determine whether other microorganisms have similar virulence 
determinants. Sequence homologies may be established by searching in 
existing databases, e.g. EMBL or Genbank. 

Virulence genes are often clustered together in distinct chromosomal 
regions called pathogenicity islands. Pathogenicity islands can be recognised 
as they are usually flanked by repeat sequences, insertion elements or tRNA 
genes. Also the G+C content is normally different from the remainder of the 
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chromosome, suggesting that they were acquired by horizontal transmission 
from another organism. For example the G+C content of the E. coli K12 
genome is 52%. Any pathogenicity islands found in E. coli strains are likely to 
have a G+C content that varies from this average. 

The identified virulence genes are likely to be useful both in generating 
attenuated vaccine strains and as a target for antimicrobials. The same may 
be true for homoiogues in Gram-negative bacteria in general. 

For the purpose of this invention, the appropriate degree of homology is 
typically at least 30%, preferably at least 50%, 60% or 70%, and more 
preferably at least 80% or 90% (at the amino acid or nucleotide level). 

Proteins according to the invention may be purified and isolated by 
methods known in the art. In particular, having identified the gene sequence, 
it will be possible to use recombinant techniques to express the genes in a 
suitable host. Active fragments and homoiogues can be identified and may be 
useful in therapy. For example, the proteins or their active fragments may be 
used as antigenic determinants in a vaccine, to elicit an immune response. 
They may also be used in the preparation of antibodies, for passive 
immunisation, or diagnostic applications. Suitable antibodies include 
monoclonal antibodies, or fragments thereof, including single chain fv 
fragments. Methods for the preparation of antibodies will be apparent to those 
skilled in the art. 

The preparation of vaccines based on attenuated microorganisms is 
known to those skilled in the art. Vaccine compositions can be formulated with 
suitable carriers or adjuvants, e.g. alum, as necessary or desired, and used in 
therapy, to provide effective immunisation against E. coli or other Gram- 
negative bacteria. The preparation of vaccine formulations will be apparent to 
the skilled person. 

More generally, and as is well known to those skilled in the art, a suitable 
amount of an active component of the invention can be selected, for therapeutic 
use, as can suitable carriers or excipients, and routes of administration. These 
factors will be chosen or determined according to known criteria such as the 
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nature/severity of the condition to be treated, the type or health of the subject 
etc. 

The following Examples illustrate the invention. For the Examples, STM 
was used to screen an E. coli K1 mini-Tn5 mutant bank for attenuated mutants, 
using a mouse model of systemic infection. The basic procedure followed that 
disclosed in Hensel et al, supra. E. coli K1 containing a mini-Tn5 insertion 
within a virulence gene was not recovered from mice inoculated with a mixed 
population of mutants, and is therefore likely to be attenuated. 

The DNA region flanking either side of the mini-Tn5 insertion was cloned 
by inverse PCR or by rescue of a kanamycin-resistance marker. In the latter 
case, chromosomal DNA from the STM-derived mutant was digested with 
restriction enzymes, ligated into the plasmid pUC19, and kanamycin-resistant 
clones selected after transformation into competent E. coli K12 cells. 
Subsequent cloning and sequencing was then performed and the gene 
sequences compared using sequences in publicly available sequence 
databases (EMBL) to help characterise the putative gene products. 
Example 1 

In a first mutant, two fragments of cloned DNA were sequenced. The 
nucleotide sequences are shown as SEQ ID NO. 1 and SEQ ID NO. 3 and a 
translated region of the DNA from SEQ ID NO. 1 is shown as SEQ ID NO. 2. 
SEQ ID NO. 1 shows 99.8% identity to the mdoGH region from E. coli K12 
(EMBL database accession number AE000206) from nucleotides 2577 to 6908. 
This DNA fragment encodes the 5'-part of the ymdD gene, the entire mdoG 
gene and the 5'-part of the mdoH gene. The product of the mdoG gene is of 
unknown function, but is believed to be involved in the biosynthesis of 
membrane-derived oligosaccharides. 

SEQ ID NO. 3 shows 98.3% identity to the 3-part of the mdoH gene and 
downstream gene sequences from E. coli K1 2 (nucleotides 7 1 87 to 7760). SEQ 
ID NO. 2 shows 99.6% identity to the mdoG protein from E. coli K12 (Swiss Prot 
accession number P33136) at amino acid 1 to 511. 

The novel gene was tested for attenuation of virulence, using mixed 
infections, in a murine model of systemic infection (Achtman et al. , Infection and 
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Immunity, 1983; Vol. 39:315-335), and shown to be attenuated with a 
competitive index (CI) of 0.38. This confirms that the attenuation of the original 
transposon mutant is likely to be due to the disruption of the mdoG gene. 

Polar and a non-polar deletion mutants of mdoG were constructed. The 
mdoG gene and flanking regions were amplified by PCR with oligonucleotides 
5'-TGCTCTAGAG CC ATTACTCAGAATG GG-3' (SEQ ID NO. 49) and 5'- 
CGCGAGCTCGACGACTGAATGATCCC-3' (SEQ ID NO. 50). The product was 
cloned into pUC19. A PCR product containing 5'- and 3'-terminal fragments of 
mdoG and the entire pUC1 9 sequence was then amplified by inverse PCR with 
the oligonucleotides 5'-TCCCCCGGGTACTGCAGCACTCAACC-3' (SEQ ID 
NO. 51) and 5'-GATCCCGGGACCACTGAAATGCGTGC-3' (SEQ ID NO. 52). 
A non-poiar kanamycin resistance cassette (aphT) was inserted in both 
orientations between the mdoG sequences to give a polar and a non-polar 
construct. The mdoG::aph Tfusions were then transferred to the suicide vector 
pCDV442. The chromosomal copy of the mdoG was mutated by allelic transfer 
after conjugation of the pCDV442 constructs into wild type E. coli K1 . 

The contructed mutants were tested for attenuation of virulence in a 
murine model of systemic infection (Achtman et ai., supra). Both the polar and 
the non-polar constructs were attenuated in virulence, with competitive indices 
of 0.37 and 0.35, respectively (mean CI from three mice each). This confirms 
that the attenuation of the original transposon mutant is likely to be due to the 
disruption of the mdoG gene. 
Example 2 

A second mutant was identified with a virulence gene having the 
nucleotide sequence shown in SEQ ID NO. 4 and the translated amino acid 
sequence shown as SEQ ID NO. 5. The mini-Tn5 transposon inserted at 
nucleotide 581 (SEQ ID NO. 4) and at amino acid 187 (SEQ ID NO. 5). 

These sequences show 97.9% identity to the creC gene of E. coJi K12 
(EMBL and Genbank accession numbers M13608, AE000510 and U14003). 

The creC protein from E. coli K12 belongs to the protein family of 
histidine kinases as well as to a protein family consisting of proteins containing 
a signal domain. 
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The novel gene was tested for attenuation of virulence (Achtman ef al, 
supra.), and shown to be attenuated with a competitive index of 0.09. 

As the E. coli K12 creC gene is transcribed as part of an operon with the 
creD gene, it is possible that this attenuation is due to a polar effect on a 
presumed E. co// K1 creD gene. 
Example 3 

A third mutant had a nucleotide sequence shown as SEQ ID NO. 6 
immediately following the mini-Tn5. A translation of this sequence is shown as 
SEQ ID NO. 7. 

The nucleotide sequence shows 93.7% identity to the recG gene of E. 
coli K1 2, at nucleotides 5-1 46 (EMBL and Genbank accession numbers P24230 
and M64367). This demonstrates that the disrupted gene is at least partially 
identical to the recG gene of E. coli K12. The recG gene of E. coli K1 2 encodes 
a 76.4kD protein which functions as ATP-dependent DNA heiicase, and plays 
a critical role in DNA repair. 

In tests for attenuation, the competitive index was shown to be 0.48. The 
recG gene is transcribed as the terminal gene of an operon, and it is therefore 
unlikely that this attenuation is due to a polar effect on another E. coli K1 gene. 
Example 4 

A fourth mutant had a transposon inserted within the nucleotide 
sequence shown as SEQ ID NO. 8, with a translation product shown as SEQ 
ID NO. 9. 

The mini-Tn5 transposon inserted at nucleotide 359 and amino acid 80. 

These sequences show 98.5% sequence identity to the yggN gene of E. 
coli K12 (EMBL accession number AE000378) at nucleotides 339-1054, and 
99.6% identity at the amino acid level. 

Although the sequence of the yggN gene is known, the function of its 
encoded protein has not yet been determined. 

The novel gene was tested for attenuation of virulence, and shown to be 
attenuated with a competitive index of 0.43. 
Example 5 
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Several mutants were also found with a transposon insertion within the 
same region. Cloning and sequencing the region revealed a nucleotide 
sequence shown as SEQ ID NO. 10. This sequence has homology with the 
tatABCD operon of E. coii K12 (EMBL and Genbank accession numbers 
AJ005830, AE000459 and AE000167). This operon encodes proteins of 
predicted mass 9.6 kD, 18.4 kD, 28.9 kD and 29.5 kD, which function as 
components of a Sec-independent protein export pathway. The pathway 
permits translocation of fully folded proteins to the periplasm through a gated 
pore, after the attachment of co-factors in the cytoplasm. 

Translation of the nucleotide sequence revealed a protein corresponding 
to tatA (SEQ ID NO. 11), a sequence corresponding to tatB (SEQ ID NO. 12), 
a sequence corresponding to tatC (SEQ ID NO. 13) and a sequence 
corresponding to tatD (SEQ ID NO. 14). 

The mini-Tn5 transposons in the mutants identified by STM are located 
at nucleotides 1429 and 2226 of SEQ ID NO. 10. These transposon insertions 
disrupt the tatB protein sequence at amino acid 50 and the tatC protein 
sequence at amino acid 143. 

The tatB and tatC genes were tested for attenuation of virulence and 
were shown to be attenuated with competitive indices of 0.0012 and 0.0039, 
respectively. These genes were also attenuated in virulence when tested in 
single infections in the same model of systemic infection. 
Example 6 

A further mutant was insertionally inactivated within a region 
corresponding to the tatE gene of E. col's K12, shown as SEQ ID NO. 15. A 
translation of the sequence as shown as SEQ ID NO. 16. The tatE gene shows 
98% identity to that of the E. coll K12 gene (accession number AE000167) at 
nucleotides 6719-7306. 

To establish whether the tatA, tatD and tatE genes are required for 
virulence, non-polar deletion mutations were constructed in each. The regions 
of DNA flanking either side of the tatA, tatD and tatE genes were amplified with 
the following primers: 
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tatA 

5'-TCG TCT AGA GAT GAT GGT GAT GGA GCG-3' (SEQ ID NO. 53) 
5'-GAA CTG CAG CCA AAT ACT GAT ACC ACC C-3' (SEQ ID NO. 54) 
5*-GAA CTG CAG GCT AAA ACA GAA GAC GCG-3' (SEQ ID NO. 55) 
5'-CAT GCA TGC ACT CCA TAT GAC AAC CGC-3' (SEQ ID NO. 56) 

Primers SEQ ID NO. 53 and SEQ ID NO. 54 were used to amplify DNA 
sequences upstream of tatA, Primers SEQ ID NO. 55 and SEQ ID NO. 56 were 
used to amplify DNA sequences downstream of tatA. 

tatD 

5'-TCG TCT AGA ATG AAG CTG CGC ATG AGG-3' (SEQ ID NO. 57) 

5'-CAA CTG CAG TCG CAA ATT GCG AAC TGG-3' (SEQ ID NO. 58) 

5'-CAA CTG CAG ACC GCA ACT TTT CGA CGC-3' (SEQ ID NO. 59) 

5'-CAT GCA TGC CAG TGA GCC ATT GTT CCC-3' (SEQ ID NO. 60) 

Primers SEQ ID NO. 57 and SEQ ID NO. 58 were used to amplify DNA 
sequences upstream of tatD, Primers SEQ ID NO. 59 and SEQ ID NO. 60 were 
used to amplify DNA sequences downstream of tatD. 

tatE 

5'-TGC TCT AGA TAC GAC TCT GAC AGG AGG-3' 
5'-TCA GAT ATC AAC TAC CAG CAG TTT GG-3* 
5'-TCA GAT ATC CAT AAA GAG TGA CGT GGC-3' 
5'-TGC TCT AGA AAA CGT GGC AAC AGA GCG-3 

Primers SEQ ID NO. 61 and SEQ ID NO. 62 were used to amplify DNA 
sequences upstream of tatE, Primers SEQ ID NO. 63 and SEQ ID NO. 64 were 
used to amplify DNA sequences downstream of tatE. 



(SEQ ID NO. 61) 
(SEQ ID NO. 62) 
(SEQ ID NO. 63) 
(SEQ ID NO. 64) 
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After cloning these flanking DNA fragments into pUC19, a non-polar 
ap/?7kanamycin resistance cassette (Galan era/, J. Bacterid, 1992; 174:4338- 
4349) was inserted between the flanking DNA fragments to replace the tatA, 
tatD and tatE genes. These DNA fragments were then transferred to the suicide 
vector pCVD442 (Blomfield ef. a/, Moi. Micro., 1991;5:1447-1457). The 
chromosomal copies of the E. coli K1 tatA, tatD and tatE genes were then 
mutated by allelic transfer after conjugation of the pCVD442 constructs into wild 
typeE. coli K1 . 

Disruptions of the tatA, tatD and tatE genes have been tested for 
attenuation of virulence (Achtman ef a/., supra). 

None of the genes was attenuated when deleted in isolation. The genes 
may still play a role in virulence, and to test this, mutants were prepared with 
deletions in both tatA and tatE genes. The double mutant was tested for 
attenuation in virulence using mixed infections with the wild-type strain and 
shown to be attenuated with a competitive index of 0.0017. It seems therefore 
that the tatA, tatD and tatE genes may be used in combination to create 
avirulent microorganisms. 

Given the similarity of the E. coli K1 tatABCD genes to predicted 
tatABCD genes present in the S. typhimurium genome and Neisseria 
meningitidis genome it seemed likely that the tat system may also be required 
for virulence in these, and other, organisms. A deletion in the S. typhimurium 
tatC gene (SEQ ID NO. 17) was constructed by amplifying the DNA flanking 
either side of the tatC gene with the following primers: 

5'-TGC TCT AGA AGG CGT TGT CGA TCC TG-3' (SEQ ID NO. 65) 

5'-GAA CTG CAG GAA AAG GCC GAG CAG ACT G-3' (SEQ ID NO. 66) 

5'-GAA CTG CAG TAC AGC CAT GTT TAC GGT-3' (SEQ ID NO. 67) 



5'-CAT GCA TGC GGT GTA CGA CAG TTT GCG-3' (SEQ ID NO. 68) 
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Primers SEQ ID NO. 65 and SEQ ID NO. 66 were used to amplify DNA 
sequences downstream of the S. typhimurium tatC gene, Primers SEQ ID NO. 
67 and SEQ ID NO. 68 were used to amplify DNA sequences upstream of the 
S. typhimurium tatC gene. 

The encoded amino acid sequences for two regions of thetatC gene are 
shown as SEQ ID NO. 18 and SEQ ID NO. 19. 

After cloning these flanking DNA fragments into pUC19, a non-polar 
kanamycin resistance cassette (aphT) was inserted between the flanking DNA 
fragments to replace the S. typhimurium tatC gene. This DNA fragment was 
then transferred to the suicide vector pCVD442. The chromosomal copy of the 
S. typhimurium tatC gene was then mutated by allelic transfer after conjugation 
of the pCVD442 construct into wild type S. typhimurium strains TML and 
SL1344. 

The disrupted S. typhimurium tatC gene was tested for attenuation of 
virulence, using mixed and single infections in a murine model of systemic 
infection. For mixed infections, 6-7 week old balbC mice were inoculated 
intraperitoneal^ with 10 4 bacterial cells. Competitive indices were calculated 
after comparing the numbers of mutant and wild-type bacteria present in 
spleens after 3 days. For single infections, mice were inoculated either 
intraperitoneally or orally with varying doses and mouse survival monitored for 
17 days. The strains were attenuated in virulence, the competitive indices of 
the SL1344 tatC and TML tatC deletion strains being 0.078 and 0.098, 
respectively. 

In single infections, mouse survival was extended compared to the wild- 
type controls. 

Sequence homology was also demonstrated with the tat sequence from 
Neisseria meningitidis. The gene sequence from N. meningitidis is shown as 
SEQ ID NO. 20 and the encoded amino acid sequence for tatC is shown as 
SEQ ID NO. 21. 

To test for virulence, a deletion mutant was created using the following 
primers: 
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5'-TGCTCTAGACACATCATGGGCACACC-3' (SEQ ID NO. 69) 

5'-GAACTGCAGAACCGTCCACATCAGGCG-3' (SEQ ID NO. 70) 

5'-GAACTGCAGACCCTGCTTGCCATTCCG-3' (SEQ ID NO. 71 ) 

5'-GAACTGCAGACCCTGCTTGCCATTCCG-3' (SEQ SD NO. 72) 

Cloning of the DNA fragments and the aphT kanamycin resistance 
cassette into pUC1 9 followed the procedure outlined above for S. typhimurium. 
The chromosomal copy of the N. meningitidis tatC 

gene was mutated by transformation of the pUC19-based constructs into wild- 
type N. meningitidis cells. 

Southern analysis of the resulting transformants indicated that all the 
transformants were merodiploids and contained both the wild-type and mutated 
copies of the tatC gene. This indicates that there is some selection against the 
isolation of mutants in which the tatC gene has been deleted. 

Further studies on polar and non-polar constructs showed that 
transformants did not grow on selective media. This suggests that the N. 
meningitidis tatC gene is essential for the in vitro growth of this organism. 
Example 7 

A further mutant was identified with a transposon insertion within a 
nucleotide sequence identified herein as SEQ ID NO. 22, at nucleotide 3981 . 
The sequence defined herein as ec/rt, shows sequence homology to several 
Group 1 glycosyltransferasesfrom a number of bacteria. Sequence homology 
was also shown to the gnd gene of E. coli K12 (at nucleotides 4197-4604 of 
SEQ ID NO. 22). 

The translation of the E. coli eckl gene is shown as SEQ ID NO. 26. 
The gene has been tested for attenuation of virulence, as described above, and 
is shown to be attenuated with a competitive index of 0.025. 

Several open reading frames (ORF) were also identified from the DNA 
sequence (SEQ ID NO. 22). The first of these is defined herein as MS1 and a 
translation product shown as SEQ ID NO. 25. The amino acid sequence is 
shown to have 50.3% identity to a putative glycosyl transferase from £. coli 
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serotype 01 1 1 (TrEMBL database accession number AAD46732). The amino 
acid sequence also shows homology with the eckl protein from E. coli K1 and 
also the TrsE protein from Yersinia entercoiitica (TrEMBL database accession 
number Q56917). 

A second open reading frame identified herein as MS2 had the gene 
sequence shown as SEQ ID NO. 24. This shows sequence homology to the 
putative glycosyl transferase TrsC from Yersinia entercoiitica (TrRMBL 
database accession number Q5691 5), and also the glycosyl transferase WbnA 
from E. coli serotype 01 13 (TrEMBL database accession number AAD50485). 

A third open reading frame encodes a product identified herein as MS3 
(SEQ ID NO. 23). The amino acid sequence shows 30.2% identity to a 
rhamnosyltransferase from Streptoccus mutans. 

The gene sequence shown as SEQ ID NO. 22 may be at least part of a 
pathogenicity island, with multiple virulence genes being positioned in a cluster 
on the microorganism's genome. 
Example 8 

A further mutant was identified having a transposon insertion within the 
iroCDE operon. The nucleotide sequences flanking either side of the mini-Tn5 
insertion are shown as SEQ ID NO. 27 and SEQ ID NO. 30. 

The mini-Tn5 transposon is inserted at nucleotide 1272 of SEQ ID NO. 
27 and at nucleotide 1 of SEQ ID NO. 30 t and interrupts the iroD gene. The N- 
terminal region of iroD is shown as SEQ ID NO. 29, and the C-terminal region 
is shown as SEQ ID NO. 31. 

In addition to /roD, the gene shown as SEQ ID NO. 27 encodes a partial 
peptide with the amino acid sequence shown as SEQ ID NO. 28. This amino 
acid sequence shows 70.9% identity to the putative ATP binding cassette 
transporter iroC from Salmonella typhi. 

The gene sequence shown as SEQ ID NO. 30 includes an open reading 
frame that encodes a peptide with the amino acid sequence shown as SEQ ID 
NO. 32 and this has sequence homology to the iroE protein from Salmonella 
typhi. 
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Testing the genes in a model for attenuation of virulence, as described 
above, showed that the iroD gene was attenuated with a competitive index of 
0.107. The mini-Tn5 mutation in the iroD gene has been reintroduced into the 
wild-type E. coli K1 strain by P1 transduction. The resulting transductant is also 
attenuated in virulence with a competitive index of 0.1 . This indicates that the 
attenuated phenotype is linked to the insertion within iroD. However, it is 
possible that the attenuation is due to a polar effect on the E. coli K1 iroE gene. 
Example 9 

A further mutant was identified with a transposon insertion within the 
nucleotide sequence shown as SEQ ID NO. 33. The transposon is inserted at 
nucleotide 2264 of SEQ ID NO. 33. The nucleotide sequence shows sequence 
homology to the aslA/hemY region of E. coli K12 (EMBL accession number 
AE000456). The aslA encodes an arylsulfatase homologue whereas hem Vis 
involved in the biosynthesis of protoheme IX. This demonstrates that the 
disrupted region is at least partially identical to the aslA/hemY region of E. coli 
K12. 

The transposon is inserted at nucleotide 2264 of SEQ ID NO. 33. This 
insertion site is 216 nucleotides downstream from the stop codon of the hemY 
gene and 472 nucleotides upstream from the start codon of the aslA gene. 

The novel region has been tested for attenuation of virulence, as 
described above, and shown to be attenuated with a competitive index of 0.033. 
The mini-Tn5 mutation in this region has been reintroduced into the wild-type 
E. co// K1 strain by P1 transduction. The resulting transductant is also 
attenuated in virulence with a competitive index of 0.008. This indicates that 
the attenuated phenotype is linked to the transposon insertion in this region. 
However, polar and non-polar deletion mutants of as/A were constructed and 
tested for attenuation of virulence as described above. 

Neither the polar nor the non-polar mutants were attenuated in virulence 
and this demonstrates that the attenuation of the original transposon mutant is 
not due to a polar effect on the as/A gene. This indicates that the transposon 
is disrupting some other function encoded within the intergenic region between 
as/A and hemY. For example there could be some untranslated RNA molecule, 
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such as a regulatory RNA similar to oxyS (Altuvia ef a/., Cell, 1 997;90:43-53), 
encoded within this region. Alternatively the transposon could be disrupting 
some DNA structure that may, for example, be involved in DNA replication. 
This DNA region is also present in the pathogen Salmonella typhimurium 
suggesting that it may be important for pathogenicity in other organisms. This 
region (SEQ ID NO. 33) may be used as a target, to identify anti-microbia! 
drugs. 
Example 10 

A further mutant was identified and the DNA region flanking either side 
of the mini-Tn5 insertion was cloned and had the nucleotide sequence shown 
as SEQ ID NO. 34. This nucleotide sequence has homology with the mtd2 
gene of Herpetosiphon auraniiacus (EMBL accession number P25265), with the 
mtd2 gene product functioning as a cytosine-specific methyltransferase. The 
mtd2 gene is not found in the E. coli K12 genome and may represent a 
pathogenicity island. 

The mini-Tn5 transposon insertions were located at nucleotides 4773 
and 3764 of SEQ ID NO. 34 and were shown to interrupt the mtd2 gene. 

The amino acid sequence of the mtd2 gene is shown as SEQ ID NO. 43. 

The E. coli K1 mtd2 gene was tested for attenuation of virulence, as 
described above, and shown to be attenuated with a competitive index of 0.073. 

In addition to the mtd2 gene, a series of open reading frames were also 
identified with translation products identified herein as MS4 to MS16, SEQ ID 
NOS. 48-44 and 42-35, respectively. As the open reading frames are located 
in a potential pathogenicity island, mutations in these genes may also result in 
attenuation in virulence. Further, since it is known that E. coli and other 
bacteria may encode peptides in different forms in the nucleotide sequence, the 
coding regions of some of these proteins may overlap. In addition, any 
aminoacid sequence shown starting with Val may in fact start with Met. 
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CLAIMS 

1 . A peptide encoded by an operon including any of the genes identified 
herein as tatA, tatB, tatC, tatE, mdoG, creC, recG, yggN t eckl, iroD, iroC, iroE, 
mtd2 and ms1 to 16, obtainable from E, coli K1, or a homologue thereof in a 

5 Gram-negative bacterium, having at least 30% homology at the amino acid or 
nucleotide level, or a functional fragment thereof, for therapeutic use. 

2. A peptide according to claim 1, comprising any of the amino acid 
sequences identified herein as SEQ ID NOS. 2, 5, 7, 9, 11, 12, 13, 14, 16, 18, 
1 9, 21 , 23, 24! 25, 26, 28, 29, 31 , 32 and 35^8. 

10 3. A polynucleotide encoding a peptide according to claim 1 or claim 2, for 
therapeutic use. 

4. A host transformed to express a peptide according to claim 1 or claim 2. 

5. A vaccine comprising a peptide according to claim 1 or claim 2, or the . 
means for its expression. 

15 6, A vaccine comprising a microorganism having a virulence gene mutation, 
wherein the gene encodes a peptide according to claim 1 or claim 2. 

7. A vaccine according to claim 6, having a virulence gene deletion in two 
genes, wherein one gene encodes tatA and the other encodes tatE. 

8. A vaccine according to claim 6, wherein the gene lies within a 
20 pathogenicity island, wherein the island comprises a gene identified herein. 

9. Use of a product according to any of claims 1 to 4, or SEQ ID NO. 33, for 
screening potential drugs or for the detection of virulence. 

1 0. Use of a product according to any of claims 1 to 4, for the manufacture 
of a medicament for use in the treatment or prevention of a condition associated 

25 with infection by a Gram-negative bacterium. 

11. Use according to claim 1 0, wherein the bacterium is E. coli. 
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Abstract of the Disclosure 
The present invention is based on the identification of a series of virulence genes in 
E. coli Kl, the products of which may be implicated in the pathogenicity of the organisms. 
The identification of the genes allows them, or their expressed products, to be used in a 
number of ways to treat infection. 
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SEQUENCE LISTING 
<110> Microscience Limited 

<120> VIRULENCE GENES AND PROTEINS, AND THEIR USE 

<130> REP05921WO 

<140> 
<141> 

<160> 72 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 4333 
<212> DNA 

<213> Escherichia coli 

<220> 
<221> CDS 

<222> (1017) . . (2549) 
<400> 1 

ccattactca gaatgggcgg atacacaata 
atgccgaggc acaaaaaaat caccgatagt 
tcagaatttt ctaaattatt tctgatacgt 
accactaaca ccagtaaaaa ccacaggtgt 
ttgtcataca atgacagtcc aggccaactt 
ataaattgcg gcaatgtcag tagggggatg 
acacgtactt tccaccattt tttcaaggga 
ccggatataa cgaaaaatac ctgcatgcgg 
agccataatg acggttcggc gctattcaca 
atatgaaaag ggatccctaa caacatcagc 
cgttgcgcgg gtactgggtt catatatggt 
tgtcgggtta tgcctttagg cttgttgcca 
1 



aaaattgttc ttcttattac cgcataaccg 60 
tttaccatcg agaatttttt attcgtttta 120 
ttgaatatcc agacgcacag cgtcgtcatg 180 
gatattaatt cccaggccaa cgtattatat 240 
tccgctttcc ctttgacgta ttgcagcata 300 
gctgttaaca tcgggatacc tacacgttcg 360 
tagcgtaaaa aaagcatgta ggaaaagtac 420 
aacgagtgga tgaagtcatt aaaaagggtc 480 
tgccatgtat ggctcgaata gattaaagaa 540 
caggcgcgga tggagtcgag gaaatattca 600 
taactaatct cggatttttc gtcttatccc 660 
tagcgacacc gacctgaccg cgccaggcgc 720 
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aggcttcaag gtttttatgc atagcatcat cgctaccact aaccagaatg gaagcgtctg 780 



taagacggtt gataaataaa tttgctggca 
taggagaagc acggaaagtg aaaacggttg 
gctgaacgat accgggattc tgttgtcgga 
tcgatataag cacacaaagg gggaagtgct 



aaccctacac gaagtcgatg cttctgtctt 840 

caatcaggtg cttaatccat gagccagtgt 900 

atggcaggtt atccattaaa atagatcgga 960 

tactaattat gaaacataaa ctacaa atg 1019 
Met 



atg aaa atg cgt tgg ttg agt get gca gta atg tta acc ctg tat aca 1067 
Met Lys Met Arg Trp Leu Ser Ala Ala Val Met Leu Thr Leu Tyr Thr 
5 10 15 



tct tea age tgg get ttc agt att gat gat gtc gca aag caa get caa 1115 
Ser Ser Ser Trp Ala Phe Ser lie Asp Asp Val Ala Lys Gin Ala Gin 
20 25 30 

tec tta gee ggg aaa ggc tat gag gcg ccc aaa age aac ttg ccc tec 1163 
Ser Leu Ala Gly Lys Gly Tyr Glu Ala Pro Lys Ser Asn Leu Pro Ser 
35 40 45 



gtt ttc cgc gat atg aaa tac gcg gac 
Val Phe Arg Asp Met Lys Tyr Ala Asp 
50 55 

cat gac aaa gcg tac tgg aac aat ctg 
His Asp Lys Ala Tyr Trp Asn Asn Leu 
70 



tat cag cag ate cag ttt aat 1211 
Tyr Gin Gin lie Gin Phe Asn 
60 65 

aag acc cca ttc aaa etc gag 1259 
Lys Thr Pro Phe Lys Leu Glu 
75 80 



ttc tac cat cag ggt atg tac ttc gat acc ccg gtc aaa ata aat gaa 1307 
Phe Tyr His Gin Gly Met Tyr Phe Asp Thr Pro Val Lys lie Asn Glu 
85 90 95 



gtg act gec acc gca gtc aaa cga ate aaa tac age ccg gat tat ttc 1355 
Val Thr Ala Thr Ala Val Lys Arg lie Lys Tyr Ser Pro Asp Tyr Phe 
100 105 110 

act ttc ggc gat gtt cag cat gac aaa gac acg gta aaa gac ctt ggt 1403 
Thr Phe Gly Asp Val Gin His Asp Lys Asp Thr Val Lys Asp Leu Gly 
115 120 125 



ttt gec ggt ttc aaa gtg ctt tac ccg ate aac age aaa gat aaa aac 1451 
Phe Ala Gly Phe Lys Val Leu Tyr Pro lie Asn Ser Lys Asp Lys Asn 
130 135 140 145 



2 



WO 00/28038 



PCT/GB99/03721 



gat gaa ate gtc age atg etc ggg gec age tat ttc cgc gtg att ggt 

Asp Glu lie Val Ser Met Leu Gly Ala Ser Tyr Phe Arg Val lie Gly 

150 155 160 

gca ggt cag gtt tat ggc ctt tct gca cgc ggc ctg gca att gat ace 

Ala Gly Gin Val Tyr Gly Leu Ser Ala Arg Gly Leu Ala He Asp Thr 

165 170 175 

gec ttg cca teg ggt gaa gaa ttt cca cgc ttc aaa gag ttc tgg ate 

Ala Leu Pro Ser Gly Glu Glu Phe Pro Arg Phe Lys Glu Phe Trp He 

180 185 190 

gag cgt cca aaa ccg act gat aaa cgt tta acc att tat gca ttg ctt 

Glu Arg Pro Lys Pro Thr Asp Lys Arg Leu Thr He Tyr Ala Leu Leu 

195 200 205 

gac teg ccg cgc gcg aca ggt get tac aaa ttc gta gtt atg cca gga 

Asp Ser Pro Arg Ala Thr Gly Ala Tyr Lys Phe Val Val Met Pro Gly 

210 215 220 225 

cgt gac acg gtt gtg gat gtg cag teg aaa ate tat ctg cgc gat aaa 

Arg Asp Thr Val Val Asp Val Gin Ser Lys He Tyr Leu Arg Asp Lys 

230 235 240 

gtc ggc aaa ctg ggg gtt gca ccg tta acc agt atg ttc ctg ttt ggg 

Val Gly Lys Leu Gly Val Ala Pro Leu Thr Ser Met Phe Leu Phe Gly 

245 250 255 

ccg aac caa ccg teg cct gca aat aac tat cgt ccg gag ttg cac gac 

Pro Asn Gin Pro Ser Pro Ala Asn Asn Tyr Arg Pro Glu Leu His Asp 

260 265 270 

tct aac ggt ctg tct ate cat get ggt aat ggc gaa tgg ate tgg cgt 

Ser Asn Gly Leu Ser He Kis Ala Gly Asn Gly Glu Trp He Trp Arg 

275 280 285 

ccg ttg aat aac ccg aaa cat tta gcg gtc age age ttc teg atg gaa 

Pro Leu Asn Asn Pro Lys His Leu Ala Val Ser Ser Phe Ser Met Glu 

290 295 300 305 

aac ccg caa ggc ttc ggt eta ttg cag cgt ggt cgt gat ttc tec cgc 

Asn Pro Gin Gly Phe Gly Leu Leu Gin Arg Gly Arg Asp Phe Ser Arg 

310 315 320 

ttt gaa gat etc gat gat cgt tac gat ctt cgt cca age gca tgg gtg 
Phe Glu Asp Leu Asp Asp Arg Tyr Asp Leu Arg Pro Ser Ala Trp Val 

325 330 335 
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act ccg aaa ggg gag tgg ggc aaa ggc age gtt gag ctg gtg gaa att 2075 
Thr Pro Lys Gly Glu Trp Gly Lys Gly Ser Val Glu Leu Val Glu lie 
340 345 350 

cca acc aac gat gaa acc aac gat aac ate gtc get tac tgg acg ccg 2123 
Pro Thr Asn Asp Glu Thr Asn Asp Asn lie Val Ala Tyr Trp Thr Pro 
355 360 365 

gat cag ctg ccg gag ccg ggt aaa gag atg aac ttt aaa tac acc ate 2171 
Asp Gin Leu Pro Glu Pro Gly Lys Glu Met Asn Phe Lys Tyr Thr lie 
370 375 380 385 

acc ttc age cgt gat gaa gac aaa ctg cat gcg cca gat aac gca tgg 2219 
Thr Phe Ser Arg Asp Glu Asp Lys Leu His Ala Pro Asp Asn Ala Trp 
390 395 400 

gtg caa caa acg cgt cgt tea acg ggg gat gtg aag cag teg aac ctg 2267 
Val Gin Gin Thr Arg Arg Ser Thr Gly Asp Val Lys Gin Ser Asn Leu 
405 410 415 

att cgc cag cct gac ggt act ate gee ttt gtg gtc gat ttt acc ggc 2315 
lie Arg Gin Pro Asp Gly Thr lie Ala Phe Val Val Asp Phe Thr Gly 
420 425 430 

get gag atg aaa aaa ctg cca gag gat acc ccg gtc aca gcg caa acc 2363 
Ala Glu Met Lys Lys Leu Pro Glu Asp Thr Pro Val Thr Ala Gin Thr 
435 440 445 

age att ggt gat aat ggt gag ata gtt gaa age acg gtg cgt tat aac 2411 
Ser lie Gly Asp Asn Gly Glu lie Val Glu Ser Thr Val Arg Tyr Asn 
450 455 460 465 

ccg gtt acc aaa ggc tgg cgt ctg gtg atg cgt gtg aaa gtg aaa gat 2459 
Pro Val Thr Lys Gly Trp Arg Leu Val Met Arg Val Lys Val Lys Asp 
470 475 480 

gec aag aaa acc act gaa atg cgt get gcg ctg gtg aat gee gat cag 2507 
Ala Lys Lys Thr Thr Glu Met Arg Ala Ala Leu Val Asn Ala Asp Gin 
485 490 495 

acg ttg agt gaa acc tgg age tac cag tta cct gee aat gaa 2549 
Thr Leu Ser Glu Thr Trp Ser Tyr Gin Leu Pro Ala Asn Glu 
500 505 510 

taagacaact gagtacattg acgcaatgcc catcgccgca agegagaaag cggcattgcc 2609 

gaagactgat atccgcgccg ttcatcaggc getggatgee gaacaccgca cctgggcgcg 2669 
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ggaggatgac tccccgcaag gctcggtaaa 
acttgctgat ggacagttaa ttaaagacga 
agaagtaaaa cgctcctcga tgtttcccga 
ctgggatcgc ctgcgtggac gcgatgtgac 
agagcaggag agtgagcaaa agtggcgtac 
gatcctgacg ctcgcgcaaa ctgttgtcgc 
tcaggggtgg gcgctgatta atcctatgga 
tatgcagctt ctgccttata tgctgcaaac 
ctgttgggtg tccgccggat tctggaccgg 
gtcgcgataa atacagtata tctgcgtcaa 
atcgcacggc gttgatcatg cctatctgta 
tgcgtgcaac gtgggaatca gtaaaagcca 
ttcttagtga cagttataac ccggatatct 
ttatcgctga agtcggtggc gaaggtcaga 
agcgtaaaag cggtaatatc gatgacttct 
tggtggtgct ggatgctgac tcggtaatga 
tgatggaagc caacccgaac gccgggatca 
atacgctgta tgcgcgctgt cagcagttcg 
ccggtttgca cttctggcaa cttggcgagt 
gcgtgaaacc gtttatcgag cactgtgcac 
ccggttcaat cctgtcacat gacttcgtgg 
gggtctggat tgcttacgat ctcccgggtt 
atgagctaaa acgtgaccgc cgctggtgcc 
tggtgaaggg tatgcacccg gttcaccgtg 



ggcgcgtctg gaacaagcct ggccagattc 2729 
cgaagggcgc gatcagctaa aggcgatgcc 2789 
cccgtggcgt accaacccgg taggccgttt 2849 
gccgcgctat ctggctcgtt tgaccaaaga 2909 
cgtcggtacc atccgccgtt acattctgtt 2969 
gacctggtat atgaagacca ttcttcctta 3029 
tatggttggt caggatgtgt gggtttcctt 3089 
cggtatcctg atcctctttg cggtactgtt 3149 
cgttgatggg cttcctgcaa ctgcttattg 3209 
cagttggcga tgaaccatta aacccggagc 3269 
acgaagacgt gaaccgtgtt tttgctggct 3329 
ccgggaatgc caaacatttt gatgtctaca 3389 
gcgtcgcaga gcaaaaagcc tggatggagc 3449 
ttttctatcg ccgccgccgc cgtcgcgtga 3509 
gccgtcgctg gggcagccag tacagctaca 3569 
ccggtgattg tttgtgcggc ctggtgcgcc 3629 
ttcagtcgtc gccgaaagcg tccggcatgg 3689 
cgacccgcgt gtatgggcca ctgtttacag 3749 
cgcactactg ggggcataac gcgattatcc 3809 
tggctccgct gccgggcgaa ggttcttttg 3869 
aagcggcgtt gatgcgccgt gcaggttggg 3929 
cttatgaaga attaccgcct aacttgcttg 3989 
acggtaacct gatgaacttc cgtctgttcc 4049 
cggtgttcct gacgggcgtg atgtcttatc 4109 
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tctccgctcc gctgtggttt atgttcctcg cgctctctac tgcattgcag gtagtacatg 4169 
cgttgaccga accgcaatac ttcctgcaac cacggcagtt gttcccggta tggccgcagt 4229 
ggcgtcctga gctggcgatt gcactttttg cttcgaccat ggtgctgttg ttcctgccga 4289 
agctattgag cattttgctt atctggtgca aaggaacgaa agaa 4333 

<210> 2 
<211> 511 
<212> PRT 

<213> Escherichia coli 
<400> 2 

Met Met Lys Met Arg Trp Leu Ser Ala Ala Val Met Leu Thr Leu Tyr 
15 io 15 

Thr Ser Ser Ser Trp Ala Phe Ser lie Asp Asp Val Ala Lys Gin Ala 



Gin Ser Leu Ala Gly Lys Gly Tyr Glu Ala Pro Lys Ser Asn Leu Pro 
35 40 45 

Ser Val Phe Arg Asp Met Lys Tyr Ala Asp Tyr Gin Gin lie Gin Phe 
50 55 60 

Asn His Asp Lys Ala Tyr Trp Asn Asn Leu Lys Thr Pro Phe Lys Leu 



Glu Phe Tyr His Gin Gly Met Tyr Phe Asp Thr Pro Val Lys lie Asn 
85 90 95 

Glu Val Thr Ala Thr Ala Val Lys Arg lie Lys Tyr Ser Pro Asp Tyr 
100 105 110 

Phe Thr Phe Gly Asp Val Gin His Asp Lys Asp Thr Val Lys Asp Leu 
115 120 125 

Gly Phe Ala Gly Phe Lys Val Leu Tyr Pro He Asn Ser Lys Asp Lys 
130 135 140 

Asn Asp Glu He Val Ser Met Leu Gly Ala Ser Tyr Phe Arg Val He 
145 150 155 160 

Gly Ala Gly Gin Val Tyr Gly Leu Ser Ala Arg Gly Leu Ala He Asp 
165 170 175 
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Thr Ala Leu Pro Ser Gly Glu Glu Phe Pro Arg Phe Lys Glu Phe Trp 
180 185 190 

lie Glu Arg Pro Lys Pro Thr Asp Lys Arg Leu Thr lie Tyr Ala Leu 
195 200 205 

Leu Asp Ser Pro Arg Ala Thr Gly Ala Tyr Lys Phe Val Val Met Pro 
210 215 220 

Gly Arg Asp Thr Val Val Asp Val Gin Ser Lys lie Tyr Leu Arg Asp 
225 230 235 240 

Lys Val Gly Lys Leu Gly Val Ala Pro Leu Thr Ser Met Phe Leu Phe 
245 250 255 

Gly Pro Asn Gin Pro Ser Pro Ala Asn Asn Tyr Arg Pro Glu Leu His 
260 265 270 

Asp Ser Asn Gly Leu Ser lie His Ala Gly Asn Gly Glu Trp lie Trp 
275 280 285 

Arg Pro Leu Asn Asn Pro Lys His Leu Ala Val Ser Ser Phe Ser Met 
290 295 300 

Glu Asn Pro Gin Gly Phe Gly Leu Leu Gin Arg Gly Arg Asp Phe Ser 
305 310 315 320 

Arg Phe Glu Asp Leu Asp Asp Arg Tyr Asp Leu Arg Pro Ser Ala Trp 
325 330 335 

Val Thr Pro Lys Gly Glu Trp Gly Lys Gly Ser Val Glu Leu Val Glu 
340 345 350 

lie Pro Thr Asn Asp Glu Thr Asn Asp Asn He Val Ala Tyr Trp Thr 
355 360 365 

Pro Asp Gin Leu Pro Glu Pro Gly Lys Glu Met Asn Phe Lys Tyr Thr 
370 375 380 

He Thr Phe Ser Arg Asp Glu Asp Lys Leu His Ala Pro Asp Asn Ala 
385 390 395 400 

Trp Val Gin Gin Thr Arg Arg Ser Thr Gly Asp Val Lys Gin Ser Asn 
405 410 415 

Leu He Arg Gin Pro Asp Gly Thr He Ala Phe Val Val Asp Phe Thr 
420 425 430 
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Gly Ala Glu Met Lys Lys Leu Pro Glu Asp Thr Pro Val Thr Ala Gin 
435 440 445 

Thr ser lie Gly Asp Asn Gly Glu He Val Glu Ser Thr Val Arg Tyr 
450 455 460 

Asn Pro Val Thr Lys Gly Trp Arg Leu Val Met Arg Val Lys Val Lys 
465 470 475 480 

Asp Ala Lys Lys Thr Thr Glu Met Arg Ala Ala Leu Val Asn Ala Asp 
485 490 495 

Gin Thr Leu Ser Glu Thr Trp Ser Tyr Gin Leu Pro Ala Asn Glu 
500 505 510 



<210> 3 
<211> 574 
<212> DNA 

<213> Escherichia coli 
<400> 3 

ttcgttgatc ctgtcaccgt ttgttcggtt atttccagcc gtgccaccgt tggtctgcga 60 

accaaacgct ggaaactgtt ccctgatccc ggaagagtat tcaccgccgc aggtgctggt 120 

tgataccgat cggttccttg agatgaatcg tcaatgctcc cttgatgatg gttttatgca 180 

cgcggtgttt aacccgtcat ttaacgctct ggcaaccgca atggcgaccg cgcgtcaccg 240 

cgccagcaag gtgctggaaa tcgcccgtga ccgccacgtt gaacaggcgc tgaacgagac 300 

gccagagaag ctgaatcgcg atcgtcgcct ggtgctgcta agcgatccgg tgacgatggc 360 

ccgtctgcat ttccgcgtct ggaattcccc ggagagatat tcttcatggg tgagttatta 420 

cgaagggata aagctcaatc cactggcatt gcgtaaaccg gatgcggctt cgcaataaaa 480 

acgtagttgc ctgatgcgct acgcttatca ggcctacatc gttcctgcaa tttattgatt 540 

ttgcaagatt ttgtaggtcg gataaggcgt tcac 574 



<210> 4 
<211> 1478 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (25).. (1449) 

<400> 4 

gggataatgc ctgaggggcc tgta atg cgt ate ggc atg egg ttg ttg ctg 51 
Met Arg He Gly Met Arg Leu Leu Leu 
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ggc tat ttt tta ctg gtg gcg gtg gcg gcc tgg ttc gta ctg get att 99 
Gly Tyr Phe Leu Leu Val Ala Val Ala Ala Trp Phe Val Leu Ala He 
10 15 20 25 

ttt gtc aaa gaa gtt aaa ccg ggc gtg cga aga gca acc gag ggg acg 147 
Phe Val Lys Glu Val Lys Pro Gly Val Arg Arg Ala Thr Glu Gly Thr 
30 35 40 

tta ate gac acc gca acg ttg ctg gcg gag ctg gcg cgt ccc gat ttg 195 
Leu lie Asp Thr Ala Thr Leu Leu Ala Glu Leu Ala Arg Pro Asp Leu 
45 50 55 

etc tct ggg gac cca acg cat ggg caa ctg gcg cag gcg ttt aat cag 243 
Leu Ser Gly Asp Pro Thr His Gly Gin Leu Ala Gin Ala Phe Asn Gin 
60 65 70 

eta caa cat cgc ccg ttt cgc gcc aat ate ggt ggc att aac aaa gtg 291 
Leu Gin His Arg Pro Phe Arg Ala Asn He Gly Gly He Asn Lys Val 
75 80 85 

cgc aac gaa tat cat gtc tat atg acc gat gcg cag ggc aaa gta ttg 339 
Arg Asn Glu Tyr His Val Tyr Met Thr Asp Ala Gin Gly Lys Val Leu 
90 95 100 105 

ttc gat teg gca aat aaa gcc gtt gga cag gat tat teg cgc tgg aat 387 
Phe Asp Ser Ala Asn Lys Ala Val Gly Gin Asp Tyr Ser Arg Trp Asn 
110 115 120 

gac gtc tgg eta acg ttg cgt ggt cag tat ggt gcg cgc age acg ttg 435 
Asp Val Trp Leu Thr Leu Arg Gly Gin Tyr Gly Ala Arg Ser Thr Leu 
125 130 135 

caa aat cct gcc gat ccc gaa agt tct gtg atg tat gtt gcc gcg ccg 483 
Gin Asn Pro Ala Asp Pro Glu Ser Ser Val Met Tyr Val Ala Ala Pro 
140 145 150 

att atg gac ggc teg egg ctt att ggc gtt ttg age gta ggc aaa ccg 531 
He Met Asp Gly Ser Arg Leu He Gly Val Leu Ser Val Gly Lys Pro 

155 160 165 

aac gcg gcg atg get ccg gtc att aag cgt age gag egg cga att tta 579 
Asn Ala Ala Met Ala Pro Val He Lys Arg Ser Glu Arg Arg He Leu 
170 175 180 185 

tgg gcc age gcc att ttg ttg ggg att gca ctg gtg att ggc gca ggc 627 
Trp Ala Ser Ala He Leu Leu Gly He Ala Leu Val He Gly Ala Gly 



WO 00/28038 



PCT/GB99/03721 



atg gtt tgg tgg ate aac cgc tct att gec agg etc act cgc tat get 675 
Met Val Trp Trp lie Asn Arg Ser lie Ala Arg Leu Thr Arg Tyr Ala 
205 210 215 

gat tec gtc act gac aat aag ccc gtt cct etc ccc gat etc ggt agt 723 
Asp Ser Val Thr Asp Asn Lys Pro Val Pro Leu Pro Asp Leu Gly Ser 
220 225 230 

age gag ttg cgt aaa etc gcg cag gcg ctg gaa agt atg cgc gtg aag 771 
Ser Glu Leu Arg Lys Leu Ala Gin Ala Leu Glu Ser Met Arg Val Lys 
235 240 245 

ctg gaa ggg aaa aac tat att gag cag tat gtt tat gcg tta act cat 819 
Leu Glu Gly Lys Asn Tyr lie Glu Gin Tyr Val Tyr Ala Leu Thr His 
250 255 260 265 

gag eta aaa age cca ctg gcg gcg att cgt ggc gcg gcg gaa att tta 867 
Glu Leu Lys Ser Pro Leu Ala Ala lie Arg Gly Ala Ala Glu lie Leu 
270 275 280 

cgc gaa ggt ccg ccg ccg gaa gtg gtg get cgt ttt acc gac aac att 915 
Arg Glu Gly Pro Pro Pro Glu Val Val Ala Arg Phe Thr Asp Asn lie 
285 290 295 

ctg acg caa aat gcg cga atg cag gca ctg gtg gaa acg tta eta cgc 963 
Leu Thr Gin Asn Ala Arg Met Gin Ala Leu Val Glu Thr Leu Leu Arg 
300 305 310 

cag gca aga ctg gag aat cgt cag gaa gtc gtt ctg act get gtt gat 1011 
Gin Ala Arg Leu Glu Asn Arg Gin Glu Val Val Leu Thr Ala Val Asp 
315 320 325 

gtg gcg gca tta ttt cgc cgc gtc age gaa gcg cgc acc gtg cag ttg 1059 
Val Ala Ala Leu Phe Arg Arg Val Ser Glu Ala Arg Thr Val Gin Leu 
330 335 340 345 

gca gaa aaa aac ate act ttg cat gtt atg cct act gag gtt aac gtt 1107 
Ala Glu Lys Asn lie Thr Leu His Val Met Pro Thr Glu Val Asn Val 
350 355 360 

get tct gaa ccg gcg tta ctg gag cag gcg ctg ggg aat tta ctg gat 1155 
Ala Ser Glu Pro Ala Leu Leu Glu Gin Ala Leu Gly Asn Leu Leu Asp 
365 370 375 

aac gec ate gat ttt act ccc gag age ggt tgc ata acg eta age gec 1203 
Asn Ala He Asp Phe Thr Pro Glu Ser Gly Cys He Thr Leu Ser Ala 
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380 385 390 

gaa gtg gat cag gaa tac gtc acc ctt aag gtg ctg gat acc ggt agt 1251 
Glu Val Asp Gin Glu Tyr Val Thr Leu Lys Val Leu Asp Thr Gly Ser 
395 400 405 



ggg att cct gac tac gcg ctg tea cgt att ttt gaa cgc ttt tac tct 1299 
Gly lie Pro Asp Tyr Ala Leu Ser Arg lie Phe Glu Arg Phe Tyr Ser 
410 415 420 425 



ttg ccg cgt gca aat ggg caa aaa age age ggt ctg ggg ttg gcg ttt 1347 
Leu Pro Arg Ala Asn Gly Gin Lys Ser Ser Gly Leu Gly Leu Ala Phe 
430 435 440 



gtc agt gag gtc gec cgt ttg ttt aac ggc gaa gtc acg ctg cgc aac 1395 
Val Ser Glu Val Ala Arg Leu Phe Asn Gly Glu Val Thr Leu Arg Asn 
445 450 455 



gtg cag gaa ggt ggc gtg ctg gec teg ctt cga ctt cac cgt cac ttc 1443 
Val Gin Glu Gly Gly Val Leu Ala Ser Leu Arg Leu His Arg His Phe 
460 465 470 



aca tag cttcaaattc ttcccacata gtcttcgta 
Thr 

475 



<210> 5 
<211> 474 
<212> PRT 

<213> Escherichia coli 
<400> 5 

Met Arg lie Gly Met Arg Leu Leu Leu Gly Tyr Phe Leu Leu Val Ala 
15 10 15 

Val Ala Ala Trp Phe Val Leu Ala lie Phe Val Lys Glu Val Lys Pro 
20 25 30 

Gly Val Arg Arg Ala Thr Glu Gly Thr Leu He Asp Thr Ala Thr Leu 
35 40 45 

Leu Ala Glu Leu Ala Arg Pro Asp Leu Leu Ser Gly Asp Pro Thr His 
50 55 60 

Gly Gin Leu Ala Gin Ala Phe Asn Gin Leu Gin His Arg Pro Phe Arg 

65 70 75 80 
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Ala Asn lie Gly Gly lie Asn Lys Val Arg Asn Glu Tyr His Val Tyr 



Met Thr Asp Ala Gin Gly Lys Val Leu Phe Asp Ser Ala Asn Lys Ala 
100 105 110 

Val Gly Gin Asp Tyr Ser Arg Trp Asn Asp Val Trp Leu Thr Leu Arg 
115 120 125 

Gly Gin Tyr Gly Ala Arg Ser Thr Leu Gin Asn Pro Ala Asp Pro Glu 
130 135 140 

Ser Ser Val Met Tyr Val Ala Ala Pro lie Met Asp Gly Ser Arg Leu 
145 150 155 160 

He Gly Val Leu Ser Val Gly Lys Pro Asn Ala Ala Met Ala Pro Val 
165 170 175 

He Lys Arg Ser Glu Arg Arg He Leu Trp Ala Ser Ala He Leu Leu 
180 185 190 

Gly He Ala Leu Val He Gly Ala Gly Met Val Trp Trp He Asn Arg 
195 200 205 

Ser He Ala Arg Leu Thr Arg Tyr Ala Asp Ser Val Thr Asp Asn Lys 
210 215 220 

Pro Val Pro Leu Pro Asp Leu Gly Ser Ser Glu Leu Arg Lys Leu Ala 
225 230 235 240 

Gin Ala Leu Glu Ser Met Arg Val Lys Leu Glu Gly Lys Asn Tyr He 
245 250 255 

Glu Gin Tyr Val Tyr Ala Leu Thr His Glu Leu Lys Ser Pro Leu Ala 
260 265 270 

Ala He Arg Gly Ala Ala Glu He Leu Arg Glu Gly Pro Pro Pro Glu 
275 280 285 

Val Val Ala Arg Phe Thr Asp Asn He Leu Thr Gin Asn Ala Arg Met 
290 295 300 

Gin Ala Leu Val Glu Thr Leu Leu Arg Gin Ala Arg Leu Glu Asn Arg 
305 310 315 320 

Gin Glu val val Leu Thr Ala Val Asp Val Ala Ala Leu Phe Arg Arg 
325 330 335 
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Val Ser Glu Ala Arg Thr Val Gin Leu Ala Glu Lys Asn He Thr Leu 
340 345 350 

His Val Met Pro Thr Glu Val Asn Val Ala Ser Glu Pro Ala Leu Leu 
355 360 365 

Glu Gin Ala Leu Gly Asn Leu Leu Asp Asn Ala He Asp Phe Thr Pro 
370 375 380 

Glu Ser Gly Cys He Thr Leu Ser Ala Glu Val Asp Gin Glu Tyr Val 
385 390 395 400 

Thr Leu Lys Val Leu Asp Thr Gly Ser Gly He Pro Asp Tyr Ala Leu 
405 410 415 

Ser Arg He Phe Glu Arg Phe Tyr Ser Leu Pro Arg Ala Asn Gly Gin 
420 425 430 

Lys Ser Ser Gly Leu Gly Leu Ala Phe Val Ser Glu Val Ala Arg Leu 
435 440 445 

Phe Asn Gly Glu Val Thr Leu Arg Asn Val Gin Glu Gly Gly Val Leu 
450 455 460 

Ala Ser Leu Arg Leu His Arg His Phe Thr 
465 470 



<210> 6 
<211> 128 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . (126) 

<400> 6 

atg aaa ggt cgc ctg tta gat get 
Met Lys Gly Arg Leu Leu Asp Ala 
1 5 

gtt ggc gca gcg ctt agt aac aag 
Val Gly Ala Ala Leu Ser Asn Lys 
20 

gta cag gat tta etc tta cac ctt 



gtc ccg etc agt tec eta acg ggc 48 
Val Pro Leu Ser Ser Leu Thr Gly 
10 15 

ctg gcg aaa ate aac ctg cat acc 96 
Leu Ala Lys He Asn Leu His Thr 
25 30 

cct ctg eg 12 8 
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<210> 7 
<211> 42 
<212> PRT 

<213> Escherichia coli 
<400> 7 

Met Lys Gly Arg Leu Leu Asp Ala Val Pro Leu Ser Ser Leu Thr Gly 
15 10 15 

Val Gly Ala Ala Leu Ser Asn Lys Leu Ala Lys lie Asn Leu His Thr 
20 25 30 

Val Gin Asp Leu Leu Leu His Leu Pro Leu 
35 40 



<210> 8 
<211> 1174 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (121) . . (837) 

<400> 8 

agatgcacga tcgagtaggc cggataaggc gtttacgccg catccagcat ggaaaacgcg 60 

cactttgtta tcaatctggg gccagcaaat gctggcctga tttgttcttg agggaagact 120 

atg atg cgc aaa atg ctg ctg gcg gca gca ctt tea gtg acg gca atg 168 
Met Met Arg Lys Met Leu Leu Ala Ala Ala Leu Ser Val Thr Ala Met 
15 10 15 

acc get cac gec gac tac cag tgc age gtc acg ccg cgt gac gat gtg 216 
Thr Ala His Ala Asp Tyr Gin Cys Ser Val Thr Pro Arg Asp Asp Val 
20 25 30 

att gtc age ccg caa acc gtg cag gtg aag ggc gaa aac ggc aat ctg 264 
He Val Ser Pro Gin Thr Val Gin Val Lys Gly Glu Asn Gly Asn Leu 
35 40 45 

gtg ate acg cca gac ggc aac gtg atg tat aac ggt aag caa tat tec 312 
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Val lie Thr Pro Asp Gly Asn Val Met Tyr Asn Gly Lys Gin Tyr Ser 



ctg aat gcc gcc cag cgc gag cag gcg aag gat tat cag get gaa eta 
Leu Asn Ala Ala Gin Arg Glu Gin Ala Lys Asp Tyr Gin Ala Glu Leu 



cgt age acc ctg ccg tgg att gat gga ggc gcg aaa age cgc gtc gag 408 

Arg Ser Thr Leu Pro Trp lie Asp Gly Gly Ala Lys Ser Arg Val Glu 
85 90 95 

aaa get cgt att gcg ctg gat aaa att ate gtt cag gag atg ggc gaa 456 

Lys Ala Arg lie Ala Leu Asp Lys He He Val Gin Glu Met Gly Glu 
100 105 no 

age age aaa atg cgc age cgt ctg acc aaa ctt gat gcg cag ctg aaa 504 

Ser Ser Lys Met Arg Ser Arg Leu Thr Lys Leu Asp Ala Gin Leu Lys 

115 120 125 

gag cag atg aac cgc att ate gaa acg cgc age gat ggc ctg acg ttt 552 

Glu Gin Met Asn Arg He He Glu Thr Arg Ser Asp Gly Leu Thr Phe 
130 135 140 

cac tat aaa gcc att gat cag gtt cgt gcc gaa ggc cag caa tta gtg 600 

His Tyr Lys Ala He Asp Gin Val Arg Ala Glu Gly Gin Gin Leu Val 
145 150 155 160 

aat cag gca atg ggc gga att tta cag gac age att aat gaa atg ggc 648 

Asn Gin Ala Met Gly Gly He Leu Gin Asp Ser He Asn Glu Met Gly 
165 170 175 

gcg aaa gcg gtg ctg aaa age ggc ggt aac cca tta cag aac gtg ctg 696 

Ala Lys Ala Val Leu Lys Ser Gly Gly Asn Pro Leu Gin Asn Val Leu 
180 185 190 

gga age ctg ggc ggc ctg caa tec tea ate caa acc gag tgg aaa aag 744 

Gly Ser Leu Gly Gly Leu Gin Ser Ser He Gin Thr Glu Trp Lys Lys 

195 200 205 

cag gaa aaa gat ttc cag cag ttt ggc aaa gat gtt tgt age cgc gtt 792 

Gin Glu Lys Asp Phe Gin Gin Phe Gly Lys Asp Val Cys Ser Arg Val 
210 215 220 

gtg act ctg gaa gat age cgc aaa gcc ctg gtc ggg aat tta aaa 837 

Val Thr Leu Glu Asp Ser Arg Lys Ala Leu Val Gly Asn Leu Lys 
225 230 235 

taatcctcta ttttaagacg gcataatact tttttatgee gtttaattet tcgttttgtt 897 
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acctgcctct aactttgtaa gggcgaattc tgcagatatc catcacactg gcggccgctc 957 
gagcatgcat ctagagggcc caattcgccc tatagtgagt cgtattacaa ttcactggcc 1017 
gtcgttttac aaccgtcgtg actgggaaaa ccctggcgtt acccaactta atcgccttgc 1077 
agcacatccc cctttcgcca gctggcgtaa tagcgaaaag gcccgcaccg atcgcccttc 1137 
caacagttgc gcacctgatg gccaatggac gcgcctg 117 4 



<210> 9 
<211> 239 
<212> PRT 

<213> Escherichia coli 
<400> 9 

Met Met Arg Lys Met Leu Leu Ala Ala Ala Leu Ser Val Thr Ala Met 
15 10 15 

Thr Ala His Ala Asp Tyr Gin Cys Ser Val Thr Pro Arg Asp Asp Val 
20 25 30 

lie Val Ser Pro Gin Thr Val Gin Val Lys Gly Glu Asn Gly Asn Leu 
35 40 45 

Val He Thr Pro Asp Gly Asn Val Met Tyr Asn Gly Lys Gin Tyr Ser 
50 55 60 

Leu Asn Ala Ala Gin Arg Glu Gin Ala Lys Asp Tyr Gin Ala Glu Leu 
65 70 75 80 

Arg Ser Thr Leu Pro Trp He Asp Gly Gly Ala Lys Ser Arg Val Glu 



Lys Ala Arg He Ala Leu Asp Lys He He Val Gin Glu Met Gly Glu 
100 105 no 

Ser Ser Lys Met Arg Ser Arg Leu Thr Lys Leu Asp Ala Gin Leu Lys 
115 120 125 

Glu Gin Met Asn Arg He He Glu Thr Arg Ser Asp Gly Leu Thr Phe 
130 135 140 

His Tyr Lys Ala He Asp Gin Val Arg Ala Glu Gly Gin Gin Leu Val 
145 150 155 160 
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Asn Gin Ala Met Gly Gly He Leu Gin Asp Ser He Asn Glu Met Gly 
165 170 175 

Ala Lys Ala Val Leu Lys Ser Gly Gly Asn Pro Leu Gin Asn Val Leu 
180 185 190 

Gly Ser Leu Gly Gly Leu Gin Ser Ser He Gin Thr Glu Trp Lys Lys 
195 200 205 

Gin Glu Lys Asp Phe Gin Gin Phe Gly Lys Asp Val Cys Ser Arg Val 
210 215 220 

Val Thr Leu Glu Asp Ser Arg Lys Ala Leu Val Gly Asn Leu Lys 
225 230 235 



<210> 10 
<211> 3406 
<212> DNA 

<213> Escherichia coli 

<220> 
<221> CDS 

<222> (1007) (1276) 

<220> 
<221> CDS 

<222> (1280) .. (1792) 

<220> 
<221> CDS 

<222> (1798) .. (2574) 

<220> 
<221> CDS 

<222> (2604) .. (3398) 
<400> 10 

gatgatggtg atggagcgta tttacggcat tccggtgtct gatgttgcga cgctggagaa 60 
aaacggcacc aacatgaaat tgctggcgga acgcggcgtg caggtgttct tcactcaggt 120 
ctttcgcgac agctttttcc atgctgatat gcaccctggc aacatcttcg taagctatga 180 
acacccggaa aacccgaaat atatcggcat tgattgcggg attgttggct cgctaaacaa 240 
agaagataaa cgctatctgg cggaaaactt tatcgccttc tttaatcgcg actatcgcaa 300 
17 
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agtggcagag ctacacgtcg attctggttg ggtgccacca gataccaacg ttgaagagtt 360 

cgaatttgcc attcgtacgg tctgtgaacc tatctttgag aaaccgctgg ccgaaatttc 420 

gtttggacat gtactgttaa atctgtttaa tacggcgcgt cgcttcaata tggaagtgca 48 0 

gccgcaactg gtgttactcc agaaaaccct gctctacgtc gaaggggtag gacgccagct 540 

ttatccgcaa ctcgatttat ggaaaacggc gaagcctttc ctggagtcgt ggattaaaga 600 

tcaggtcggt attcctgcgc tggtgagagc atttaaagaa aaagcgccgt tctgggtcga 660 

aaaaatgcca gaactgcctg aactggttta cgacagtttg cgccagggca agtatttaca 720 

gcatagtgtt ggtaagattg cccgcgagct tcagtcaaat catgtacgtc agggacaatt 7 80 

cgcgttattt tctcggaatt ggcgctacgt tagtatttaa gtggcacatt cttgttggtc 840 

agccgacctg aatgggggct gatgcccggc tggttaatgg caggtggtct gatcgcctgg 900 

tttgtccggt tggcgcaaaa cacgctgatt ttttcatcgc tcaaggcggg ccgtgtaacg 960 

tataatgcgg ctttgtttaa tcatcatcta ccacagagga acatgt atg ggt ggt 1015 

Met Gly Gly 
1 

ate agt att tgg cag tta ttg att att gec gtc ate gtt gta ctg ctt 1063 
lie Ser lie Trp Gin Leu Leu lie lie Ala Val lie Val Val Leu Leu 



ttt ggc acc aaa aag etc ggc tec ate ggt tec gat ctt ggt gcg teg 1111 
Phe Gly Thr Lys Lys Leu Gly Ser lie Gly Ser Asp Leu Gly Ala Ser 



ate aaa ggc ttt aaa aaa gca atg age gat gat gaa cca aag cag gat 1159 
lie Lys Gly Phe Lys Lys Ala Met Ser Asp Asp Glu Pro Lys Gin Asp 
40 45 50 

aaa acc age cag gat get gat ttt act gcg aaa act ate gec gat aag 1207 
Lys Thr Ser Gin Asp Ala Asp Phe Thr Ala Lys Thr lie Ala Asp Lys 
55 60 65 

cag gcg gat acg aat cag gaa cag get aaa ata gaa gac gcg aag cgc 1255 
Gin Ala Asp Thr Asn Gin Glu Gin Ala Lys lie Glu Asp Ala Lys Arg 



cac gat aaa gag cag gtg taa tct gtg ttt gat ate ggt ttt age gaa 1303 
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His Asp Lys Glu Gin Val Val Phe Asp lie Gly Phe Ser Glu 

85 90 95 

ctg eta ttg gtg ttc ate ate ggc etc gtc gtt ctg ggg ccg caa cga 1351 
Leu Leu Leu Val Phe lie lie Gly Leu Val Val Leu Gly Pro Gin Arg 
100 105 HO 

ctg cct gtg gcg gta aaa acg gta gcg ggc tgg att cgc gcg ttg cgt 13 99 
Leu Pro Val Ala Val Lys Thr Val Ala Gly Trp lie Arg Ala Leu Arg 
115 120 125 130 

tea ctg gcg aca acg gtg cag aac gaa ctg ace cag gag tta aaa etc 1447 
ser Leu Ala Thr Thr Val Gin Asn Glu Leu Thr Gin Glu Leu Lys Leu 
135 140 145 

cag gag ttt cag gac agt ctg aaa aag gtt gaa aag gcg age etc act 1495 
Gin Glu Phe Gin Asp Ser Leu Lys Lys Val Glu Lys Ala Ser Leu Thr 
150 155 160 

aac ctg acg ccc gaa ctg aaa gcg teg atg gat gaa tta cgc cag get 1543 
Asn Leu Thr Pro Glu Leu Lys Ala Ser Met Asp Glu Leu Arg Gin Ala 
165 170 175 

gcg gag teg atg aaa cgt tec tac gtt gca aac gat cct gaa aag gcg 1591 
Ala Glu Ser Met Lys Arg Ser Tyr Val Ala Asn Asp Pro Glu Lys Ala 
180 185 190 

age gat gaa gcg cac ace ate cat aac ccg gtg gtg aaa gac aat gaa 1639 
Ser Asp Glu Ala His Thr He His Asn Pro Val Val Lys Asp Asn Glu 
195 200 205 210 

act gcg cat gaa ggc gta acg cct get get gca caa acg cag gec agt 1687 
Thr Ala His Glu Gly Val Thr Pro Ala Ala Ala Gin Thr Gin Ala Ser 
215 220 225 

teg ccg gaa cag aag cca gaa ace acg cca gag ccg gtg gta aaa cct 1735 
Ser Pro Glu Gin Lys Pro Glu Thr Thr Pro Glu Pro Val Val Lys Pro 
230 235 240 

get gcg gac get gaa ccg aaa ace get gca cct tec cct teg teg agt 1783 
Ala Ala Asp Ala Glu Pro Lys Thr Ala Ala Pro Ser Pro Ser Ser Ser 
245 250 255 

gat aaa ccg taaac atg tct gta gaa gat act caa ccg ctt ate acg cat 1833 
Asp Lys Pro Met Ser Val Glu Asp Thr Gin Pro Leu He Thr His 

260 265 270 

ctg att gag ctg cgt aag cgt ctg ctg aac tgc att ate teg gtg ate 1881 



19 



WO 00/28038 



PCT/GB99/03721 



Leu lie Glu Leu Arg Lys Arg Leu Leu Asn Cys He He Ser Val He 
275 280 285 

gtg ata ttc ctg tgt ctg gtc tat ttc gcc aat gac ate tat cac ctg 1929 

Val He Phe Leu Cys Leu Val Tyr Phe Ala Asn Asp He Tyr His Leu 

290 295 300 305 

gta tec gcg cca ctg ate aag cag ttg ccg caa ggt tea acg atg ate 1977 

Val Ser Ala Pro Leu He Lys Gin Leu Pro Gin Gly Ser Thr Met He 
310 315 320 

gcc acc gac gtg gcc teg ccg ttc ttt acg ccg ate aag ctg acc ttt 2025 

Ala Thr Asp Val Ala Ser Pro Phe Phe Thr Pro He Lys Leu Thr Phe 
325 330 335 

atg gtg teg ctg att ctg tea gcg ccg gtg att etc tat cag gtg tgg 2073 

Met Val Ser Leu He Leu Ser Ala Pro Val He Leu Tyr Gin Val Trp 
340 345 350 

gcg ttt ate gcc cca gcg ctg tat aag cat gaa cgt cgc ctg gtg gtg 2121 

Ala Phe He Ala Pro Ala Leu Tyr Lys His Glu Arg Arg Leu Val Val 
355 360 365 

ccg ctg ctg gtt tec age tct ctg ctg ttt tat ate ggc atg gcg ttc 2169 

Pro Leu Leu Val Ser Ser Ser Leu Leu Phe Tyr He Gly Met Ala Phe 

370 375 380 385 

gcc tac ttt gtg gtc ttt ccg ctg gca ttt ggc ttc ctt gcc aat acc 2217 

Ala Tyr Phe Val Val Phe Pro Leu Ala Phe Gly Phe Leu Ala Asn Thr 
390 395 400 

gcg ccg gaa ggg gta cag gta tec acc gac ate gcg age tat tta age 2265 

Ala Pro Glu Gly Val Gin Val Ser Thr Asp He Ala Ser Tyr Leu Ser 
405 410 415 

ttc gtt atg gcg ctg ttt atg gcg ttt ggt gtc tec ttt gaa gtg ccg 2313 

Phe Val Met Ala Leu Phe Met Ala Phe Gly Val Ser Phe Glu Val Pro 
420 425 430 

gtg gca att gtg ctg ctg tgc tgg atg ggg att acc teg cca gaa gac 2361 

Val Ala He Val Leu Leu Cys Trp Met Gly He Thr Ser Pro Glu Asp 
435 440 445 

tta cgc aaa aaa cgc ccg tat gtg ctg gtt ggt gca ttc gtt gtc ggg 2409 

Leu Arg Lys Lys Arg Pro Tyr Val Leu Val Gly Ala Phe Val Val Gly 

450 455 460 465 

atg ttg ctg acg ccg ccg gat gtc ttc teg caa acg ctg ttg gcg ate 2457 
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Met Leu Leu Thr Pro Pro Asp Val Phe Ser Gin Thr Leu Leu Ala lie 
470 475 480 

cct atg tac tgc ctg ttt gaa ate ggt gtc ttc ttc tea cgc ttt tac 
Pro Met Tyr Cys Leu Phe Glu lie Gly Val Phe Phe Ser Arg Phe Tyr 
485 490 495 

gtt ggt aaa ggg cga aac egg gaa gag gaa aac gac get gaa gca gaa 
Val Gly Lys Gly Arg Asn Arg Glu Glu Glu Asn Asp Ala Glu Ala Glu 
500 505 510 

age gaa aaa act gaa gaa taa attcaaccgc ccgtcagggc ggttgtcat atg 
Ser Glu Lys Thr Glu Glu Met 
515 520 

gag tac agg atg ttt gat ate ggc gtt aat ttg ace agt teg caa ttt 
Glu Tyr Arg Met Phe Asp lie Gly Val Asn Leu Thr Ser Ser Gin Phe 
525 530 535 

gcg aaa gac cgt gat gat gtt gta gcg cgc get ttt gac gcg gga gtt 
Ala Lys Asp Arg Asp Asp Val Val Ala Arg Ala Phe Asp Ala Gly Val 
540 545 550 

aat ggg eta etc ate acc ggt ace aat ctg cgt gaa age cag cag gcg 
Asn Gly Leu Leu lie Thr Gly Thr Asn Leu Arg Glu Ser Gin Gin Ala 
555 560 565 

caa aag ctg gcg cgt cag tat teg tec tgt tgg tea acg gcg ggc gta 
Gin Lys Leu Ala Arg Gin Tyr Ser Ser Cys Trp Ser Thr Ala Gly Val 
570 575 580 585 

cat cct cac gac age age cag tgg caa get gtg act gaa gaa gcg att 
His Pro His Asp Ser Ser Gin Trp Gin Ala Val Thr Glu Glu Ala He 
590 595 600 

att gag ctg gee gcg cag cca gaa gtg gtg gcg att ggt gaa tgt ggt 
lie Glu Leu Ala Ala Gin Pro Glu Val Val Ala He Gly Glu Cys Gly 
605 610 615 

etc gac ttt aac cgc aac ttt teg acg ccg gaa gag cag gaa cgc get 
Leu Asp Phe Asn Arg Asn Phe Ser Thr Pro Glu Glu Gin Glu Arg Ala 
620 625 630 

ttt gtt gee cag eta cgc att gec gca gaa tta aac atg ccg gta ttt 
Phe Val Ala Gin Leu Arg He Ala Ala Glu Leu Asn Met Pro Val Phe 
635 640 645 

atg cac tgt cgc gat gee cac gag egg ttt atg aca ttg ctg gag ccg 
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Met His Cys Arg Asp Ala His Glu Arg Phe Met Thr Leu Leu Glu Pro 
650 655 660 665 

tgg ctg gat aaa ctg cct ggt gcg gtt ctt cat tgc ttt acc ggc aca 3086 

Trp Leu Asp Lys Leu Pro Gly Ala Val Leu His Cys Phe Thr Gly Thr 
670 675 680 

cgc gaa gag atg cag gcg tgc gtg gcg tgt gga att tat ate ggc att 3134 

Arg Glu Glu Met Gin Ala Cys Val Ala Cys Gly lie Tyr lie Gly lie 
685 690 695 

acc ggt tgg gtt tgc gat gaa cga cgc ggg ctg gag ctg egg gaa ttg 3182 

Thr Gly Trp Val Cys Asp Glu Arg Arg Gly Leu Glu Leu Arg Glu Leu 
700 705 710 

ttg ccg ttg att ccg gcg gag aaa ttg ctg ate gaa act gat gcg ccg 3230 

Leu Pro Leu lie Pro Ala Glu Lys Leu Leu He Glu Thr Asp Ala Pro 

715 720 725 

tat ctg etc cct cgc gat etc acg cca aag cca tea tec egg cgc aac 327 8 

Tyr Leu Leu Pro Arg Asp Leu Thr Pro Lys Pro Ser Ser Arg Arg Asn 
730 735 740 745 

gag cca gec cat ctg ccc cat att ttg caa cgt att gcg cac tgg cgt 3326 

Glu Pro Ala His Leu Pro His He Leu Gin Arg lie Ala His Trp Arg 
750 755 760 

gga gaa gat gec gca tgg ctg get gec acc acg gat gec aat gtc aaa 3374 

Gly Glu Asp Ala Ala Trp Leu Ala Ala Thr Thr Asp Ala Asn Val Lys 
765 770 775 

aca ctg ttt ggg att gcg ttt tag agtttgcg 3406 

Thr Leu Phe Gly He Ala Phe 

780 785 



<210> 11 
<211> 89 
<212> PRT 

<213> Escherichia coli 
<400> 11 

Met Gly Gly He Ser He Trp Gin Leu Leu He He Ala Val He Val 
15 10 15 

Val Leu Leu Phe Gly Thr Lys Lys Leu Gly Ser He Gly Ser Asp Leu 
20 25 30 
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Gly Ala Ser lie Lys Gly Phe Lys Lys Ala Met Ser Asp Asp Glu Pro 
35 40 45 

Lys Gin Asp Lys Thr Ser Gin Asp Ala Asp Phe Thr Ala Lys Thr lie 
50 55 60 

Ala Asp Lys Gin Ala Asp Thr Asn Gin Glu Gin Ala Lys lie Glu Asp 
65 70 75 80 

Ala Lys Arg His Asp Lys Glu Gin Val 
85 



<210> 12 
<211> 171 
<212> PRT 

<213> Escherichia coli 
<400> 12 

Val Phe Asp He Gly Phe Ser Glu Leu Leu Leu Val Phe He lie Gly 
15 10 15 

Leu Val Val Leu Gly Pro Gin Arg Leu Pro Val Ala Val Lys Thr Val 
20 25 30 

Ala Gly Trp He Arg Ala Leu Arg Ser Leu Ala Thr Thr Val Gin Asn 
35 40 45 

Glu Leu Thr Gin Glu Leu Lys Leu Gin Glu Phe Gin Asp Ser Leu Lys 
50 55 60 

Lys Val Glu Lys Ala Ser Leu Thr Asn Leu Thr Pro Glu Leu Lys Ala 
65 70 75 80 

Ser Met Asp Glu Leu Arg Gin Ala Ala Glu Ser Met Lys Arg Ser Tyr 
85 90 95 

Val Ala Asn Asp Pro Glu Lys Ala Ser Asp Glu Ala His Thr He His 
100 105 HO 

Asn Pro Val Val Lys Asp Asn Glu Thr Ala His Glu Gly Val Thr Pro 
115 120 125 

Ala Ala Ala Gin Thr Gin Ala Ser Ser Pro Glu Gin Lys Pro Glu Thr 
130 135 140 

Thr Pro Glu Pro Val Val Lys Pro Ala Ala Asp Ala Glu Pro Lys Thr 
145 150 155 160 
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Ala Ala Pro Ser Pro Ser Ser Ser Asp Lys Pro 
165 170 



<210> 13 
<211> 258 
<212> PRT 

<213> Escherichia coli 
<400> 13 

Met Ser Val Glu Asp Thr Gin Pro Leu He Thr His Leu He Glu Leu 
Arg Lys Arg Leu Leu Asn Cys He He Ser Val He Val He Phe Leu 



Cys Leu Val Tyr Phe Ala Asn Asp He Tyr His Leu Val Ser Ala Pro 
35 40 45 

Leu He Lys Gin Leu Pro Gin Gly Ser Thr Met He Ala Thr Asp Val 



Ala Ser Pro Phe Phe Thr Pro He Lys Leu Thr Phe Met Val Ser Leu 
65 70 75 80 

He Leu Ser Ala Pro Val He Leu Tyr Gin Val Trp Ala Phe He Ala 
85 90 95 

Pro Ala Leu Tyr Lys His Glu Arg Arg Leu Val Val Pro Leu Leu Val 
100 105 no 

Ser Ser Ser Leu Leu Phe Tyr He Gly Met Ala Phe Ala Tyr Phe Val 
115 120 125 

Val Phe Pro Leu Ala Phe Gly Phe Leu Ala Asn Thr Ala Pro Glu Gly 
130 135 140 

Val Gin Val Ser Thr Asp He Ala Ser Tyr Leu Ser Phe Val Met Ala 
145 150 155 160 

Leu Phe Met Ala Phe Gly Val Ser Phe Glu Val Pro Val Ala He Val 
165 170 175 

Leu Leu cys Trp Met Gly He Thr Ser Pro Glu Asp Leu Arg Lys Lys 
180 185 190 

Arg Pro Tyr Val Leu Val Gly Ala Phe Val Val Gly Met Leu Leu Thr 
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195 200 205 

Pro Pro Asp Val Phe Ser Gin Thr Leu Leu Ala He Pro Met Tyr Cys 
210 215 220 

Leu Phe Glu He Gly Val Phe Phe Ser Arg Phe Tyr Val Gly Lys Gly 
225 230 235 240 

Arg Asn Arg Glu Glu Glu Asn Asp Ala Glu Ala Glu Ser Glu Lys Thr 
245 250 255 

Glu Glu 



<210> 14 
<211> 264 
<212> PRT 

<213> Escherichia coli 
<400> 14 

Met Glu Tyr Arg Met Phe Asp He Gly Val Asn Leu Thr Ser Ser Gin 
15 io i 5 

Phe Ala Lys Asp Arg Asp Asp Val Val Ala Arg Ala Phe Asp Ala Gly 
20 25 30 

Val Asn Gly Leu Leu He Thr Gly Thr Asn Leu Arg Glu Ser Gin Gin 
35 40 45 

Ala Gin Lys Leu Ala Arg Gin Tyr Ser Ser Cys Trp Ser Thr Ala Gly 
50 55 60 

Val His Pro His Asp Ser Ser Gin Trp Gin Ala Val Thr Glu Glu Ala 
65 70 75 80 

He lie Glu Leu Ala Ala Gin Pro Glu Val Val Ala He Gly Glu Cys 
85 so 95 

Gly Leu Asp Phe Asn Arg Asn Phe Ser Thr Pro Glu Glu Gin Glu Arg 
100 105 no 

Ala Phe Val Ala Gin Leu Arg He Ala Ala Glu Leu Asn Met Pro Val 
115 120 125 

Phe Met His Cys Arg Asp Ala His Glu Arg Phe Met Thr Leu Leu Glu 
130 135 140 

Pro Trp Leu Asp Lys Leu Pro Gly Ala Val Leu His Cys Phe Thr Gly 
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145 150 

Thr Arg Glu Glu Met Gin Ala Cys 
165 

lie Thr Gly Trp Val Cys Asp Glu 
180 

Leu Leu Pro Leu lie Pro Ala Glu 
195 200 

Pro Tyr Leu Leu Pro Arg Asp Leu 
210 215 

Asn Glu Pro Ala His Leu Pro His 
225 230 

Arg Gly Glu Asp Ala Ala Trp Leu 
245 

Lys Thr Leu Phe Gly lie Ala Phe 
260 



155 160 

Val Ala Cys Gly He Tyr He Gly 
170 175 

Arg Arg Gly Leu Glu Leu Arg Glu 
185 190 

Lys Leu Leu He Glu Thr Asp Ala 
205 

Thr Pro Lys Pro Ser Ser Arg Arg 
220 

lie Leu Gin Arg lie Ala His Trp 
235 240 

Ala Ala Thr Thr Asp Ala Asn Val 
250 255 



<210> 15 
<211> 586 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (170) (370) 

<400> 15 

tcttaaacaa ccgtcgcttt gcgccgccgc aattattatg atgttttttt actcggcgct 60 

tgattcacct tgttacagat tgctattgtg tgcgcgcgtc gaatgaccgt taatattctc 120 

tggtttttaa ggcgcgttct gttgccggtt atatgtcaag aaggtatct atg ggt gag 178 

Met Gly Glu 
1 

att agt att acc aaa ctg ctg gta gtt gcg gcg ctg gtc gtt ctg ctg 226 
He Ser He Thr Lys Leu Leu Val Val Ala Ala Leu Val Val Leu Leu 
5 10 15 

ttt ggg act aag aag tta cgt acg ctg ggc gga gac ctt gga gcg gcc 274 
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Phe Gly Thr Lys Lys Leu Arg Thr Leu Gly Gly Asp Leu Gly Ala Ala 
20 25 30 35 

att aaa ggg ttc aag aag gcg atg aat gat gac gat get gcg gcg aaa 322 
lie Lys Gly Phe Lys Lys Ala Met Asn Asp Asp Asp Ala Ala Ala Lys 
40 45 50 

aaa ggc gca gac gtt gat ctt cag get gaa aag etc tct cat aaa gag 370 
Lys Gly Ala Asp Val Asp Leu Gin Ala Glu Lys Leu Ser His Lys Glu 
55 60 65 

tgacgtggcg ageaggaege tccctcaata tcttgttcga tacaaaaacc cgcttcaaaa 430 

agcgggtttt ttatcagaca gatgtaagta attattacag gattacttaa cttccatccc 490 

tttcgcctgc aaatcggcgt ggtaagaaga geggacaaac ggaccgcatg cagcatgggt 550 

aaagcccatc gccagcgctt cgctttcatt tegteg 58 6 



<210> 16 
<211> 67 
<212> PRT 

<213> Escherichia coli 
<400> 16 

Met Gly Glu lie Ser lie Thr Lys Leu Leu Val Val Ala Ala Leu Val 
15 10 15 

Val Leu Leu Phe Gly Thr Lys Lys Leu Arg Thr Leu Gly Gly Asp Leu 
20 25 30 

Gly Ala Ala lie Lys Gly Phe Lys Lys Ala Met Asn Asp Asp Asp Ala 
35 40 45 

Ala Ala Lys Lys Gly Ala Asp Val Asp Leu Gin Ala Glu Lys Leu Ser 
50 55 60 

His Lys Glu 
65 



<210> 17 
<211> 4200 
<212> DNA 

<213> Salmonella typhimurium 
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<220> 
<221> CDS 

<222> (947) (1444) 

<220> 
<221> CDS 

<222> (1450) .. (1722) 
<400> 17 

cgcaagtcaa tgtcgtcccg gtcgtatgta aaagtatgtg aatagggcgg gcgaaagcgg 60 

ctaacaaaga ggcagcgtga aggataatgt gtataatgcg gccctaataa ttcatcatct 120 

atcacagagg aacatgtatg ggtggtatca gtatttggca gttgttgatt gttgccgtta 18 0 

tcgtcgtact gctgttcggc accaaaaaac tcggttccat cggttccgat cttggcgcgt 240 

ctatcaaagg ctttaaaaag gccatgagcg atgatgatgc caaacaggat aaaaccagtc 300 

aggacgctga ttttaccgct aaatctatcg cggataagca aggcgaagcg aaaaaggaag 360 

acgctaaaag ccaagataaa gagcaggtat aatccgtgtt tgatatcggt tttagcgaac 420 

tgctgttagt gttcgttatc ggcctcattg tgttggggcc gcaacgattg ccagtagcgg 480 

taaaaacggt agcgggctgg attcgcgcgt tgcggtccct tgcgacaacg gttcagaatg 540 

aactgactca ggaactgaaa cttcaggagt tccaggacag tctgaaaaaa gtcgaaaagg 600 

cgagcctgga aaatctgact cccgaactga aagcatctat ggatgaactg cgtcaggcgg 660 

cggagtcgat gaaacgcacc tacagcgcta acgatcccga acaagcgagc gatgaagcgc 720 

ataccatcca taatccggtg gtaaaaggga acgaaacgca gcatgagggc gtcacccctg 780 

ccgccgctga aacacaggcg agcgcgccgg aacaaaagcc ggagcccgtt aaagctaacg 840 

tgcctgagtc gacggaaacc gcttccgtag ccacgataga cgccgagaag aaatccgctg 900 

cgcctgttgt cgaatcttcc ccctcgtcga gtgataaacc gtaaac atg get gta 955 

Met Ala Val 
1 

gaa gat act caa ccg ctt ate acg cat ctg ate gag ttg cgt aag cgc 1003 
Glu Asp Thr Gin Pro Leu lie Thr His Leu lie Glu Leu Arg Lys Arg 
5 10 15 

ctg eta aac tgc ate gtc gca gta ctt ctg att ttt ctg gcg tta att 1051 
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Leu Leu Asn Cys He Val Ala Val Leu Leu He Phe Leu Ala Leu He 
20 25 30 35 

tat ttc gcc aat gat att tat cat tta gtc gcc gca ccg ctg att aaa 1099 
Tyr Phe Ala Asn Asp He Tyr His Leu Val Ala Ala Pro Leu He Lys 
40 45 50 

cag atg ccg caa ggg gcg aca atg att gcg acg gat gtg gcg teg ccg 1147 
Gin Met Pro Gin Gly Ala Thr Met He Ala Thr Asp Val Ala Ser Pro 



ttt ttt acg cct ate aaa etc acc ttc atg gtg tct ttg ate tta tec 1195 
Phe Phe Thr Pro He Lys Leu Thr Phe Met Val Ser Leu He Leu Ser 
70 75 80 

gcg cct gtc att ttg tac cag gtt tgg gcc ttt ate gcc ccg gcg ctg 1243 
Ala Pro Val He Leu Tyr Gin Val Trp Ala Phe He Ala Pro Ala Leu 
85 90 95 

tat aag cat gag cgt cgt ctg gtc gta cct ctg ctg gta tec age teg 1291 
Tyr Lys His Glu Arg Arg Leu Val Val Pro Leu Leu Val Ser Ser Ser 
100 105 no us 

ctg ctt ttc tat att ggt atg gcc ttc gcc tat ttt gtc gta ttc cct 1339 
Leu Leu Phe Tyr He Gly Met Ala Phe Ala Tyr Phe Val Val Phe Pro 
120 125 130 

ttg gcc ttt ggt ttc ctg acg cat acg gcg ccg gaa ggg gta cag gtt 1387 
Leu Ala Phe Gly Phe Leu Thr His Thr Ala Pro Glu Gly Val Gin Val 
135 140 145 

teg aca gat ate gcc age tat ctt age ttt gtc atg gcg ctt ttt atg 1435 
Ser Thr Asp He Ala Ser Tyr Leu Ser Phe Val Met Ala Leu Phe Met 
150 155 160 

gcc ttt gcg tagee ttt gaa gtg ccg gtg gcg att gtg ttg ctg tgc tgg 1485 
Ala Phe Ala Phe Glu Val Pro Val Ala He Val Leu Leu Cys Trp 

165 170 175 

atg ggc ate acc acg cca gaa gat ttg cgt aaa aaa egg cct tat ate 1533 
Met Gly He Thr Thr Pro Glu Asp Leu Arg Lys Lys Arg Pro Tyr He 
180 185 190 

ctg gtc ggg gca ttc att gtg gga atg ctg ctt acg ccg cca gat gtt 1581 
Leu Val Gly Ala Phe He Val Gly Met Leu Leu Thr Pro Pro Asp Val 
195 200 205 210 

ttc teg caa acg ttg ctg gcg ata ccg atg tac tgc ctg ttt gaa att 1629 
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Phe Ser Gin Thr Leu Leu Ala lie Pro Met Tyr Cys Leu Phe Glu lie 
215 220 225 

ggc gtt ttc tgc tea cgc ttt tat gtc ggt aag cga egg acg cgc gac 1677 
Gly Val Phe Cys Ser Arg Phe Tyr Val Gly Lys Arg Arg Thr Arg Asp 
230 235 240 

gaa gat aac gag gec gaa acc gaa aag gec gag cac act gaa gac 1722 
Glu Asp Asn Glu Ala Glu Thr Glu Lys Ala Glu His Thr Glu Asp 
245 250 255 

taaacacaac cgcccgccag ggcggttgtc atatgggggc aagcatgttt gatattggcg 1782 

ttaatttaac cagtagecag tttgcaaaag atcgtgatga tgtggtcgcc cgtgcgtttg 1842 

eggegggagt aaaaggtatg ctactgaccg gaacgaacat ccatgaaagt cagcaggegt 1902 

taaaactggc gcggcgctac ccccattgtt ggtcgaegge tggegtccat ccccatgaca 1962 

gcagtcagtg gtcacccgcg tctgaagacg ecattattge getggegaac cagceggaag 2022 

tegtegctat cggtgagtgc gggctggatt teaategcaa tttttccacg ccgcaggagc 2082 

aggagcgtgc etttcaggeg cagctacaaa ttgccgccga attgeagata ccaatcttta 2142 

tgcactgccg ggacgegcat gagegattte tggtattget tgatccctgg ctggatagtc 2202 

ttcctggtgc aatactgeae tgctttaccg gttcacgcca geaaatgeag gcctgtgtgg 2262 

atagagggct etatateggt attacegggt gggtttgcga cgaacgacgc gggcttgagc 2322 

tacgtgaact cttaccgttt attccagegg aaaagctact gatagaaacc gacgcgccgt 2382 

atctgttgcc tegegatett acgccgaaac caacgtcacg aegcaacgag cccgcgtatc 2442 

tgcctcacat cctggagcgc atagegctat ggcgtggtga agatcegcaa tggttagcgg 2502 

cgatgacaga tgccaacgcc agaaccttat ttgaggttgt attctgaacg ategctaaat 2562 

ettgegaaaa ccggtgtttt ttacgctctg cttcacttct ttattgagta aattaagcag 2622 

taacatcgaa cgcgtttcgc catccggttc ggtaaaaatc gctttcagcc ettcaaatge 2682 

gccttccgtg atgatgaege tategceggg atagggggtt tcaggatcga caacgccttc 2742 

gggcttgtag atagaaagct gatgaataac gctggaaggc aegategcag gatgccgcca 2802 

aagcgcacaa aatggctgac gccgcgcgtg gcgttgattg tagtggtatg tatcacttcc 2862 
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ggatcaaatt caacgaacag ataattagga 
ccgcgtacca ttttttccag ggtgatcatc 
aggtgttcct gagcacgctg aagttgcccg 
ataatgactc ttatccgctt gttcggggcg 
attatacact tcatccttta agccgtatct 
aatttaacaa aaatacagca tcccgatgat 
ggctacaatg gacgccatga aatatcacga 
gcagggggaa ctaaaacgca tcacgctacc 
cgctgaccgc acgctgcgtg ccggtggacc 
cgccatgccg gtgctgtgca acctttttgg 
gcaggatgat gtttccgcct tacgggaagt 
tgagccgccg aaagcgtttc gcgatctgtt 
gaatatgccg acgaaacggt tacgcggcgc 
tgatgtcgat ttaacgcgtc ttcctgtcat 
gattacctgg ggactgacgg taacgcgtgg 
ttatcgtcag cagttgatag gtaaaaataa 
cggcgcgctg gattttcagg agtggttagc 
cgtcgcattg ggcgccgatc cggcacgata 
ctgtcggagt atgcctttgc gggcctgctg 
ttctacgatc tggagtgctg cagcgcgaga 
cggagacgta tgcgatcata cggcatatat 
acagcgtaga tgcatctata tcactatacg 
ctcgtctatc tcaaagtc 
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aagagcggtt cgctgacgga ggtacgtttt 2922 
ggtgtcaggc aacttaccgc ttgtctttcg 2982 
cgtttgcagt acagtaaata ccaggattgc 3042 
caagcatagc aaaagccatg cgcgaagtta 3102 
ggattagcgt tggttgccag agttcacgct 3162 
gaacgccgta taatgatgcg cttaccaaga 3222 
tttacgcgac ttcctgacgc tacttgagca 3282 
tgtggatcct catctggaaa tcacggaaat 3342 
ggcgttgctg tttgaaaatc ctaaaggtta 3402 
cacgccaaaa cgcgtggcga tgggcatggg 3462 
gggtaaatta ttagcgtttc ttaaagaacc 3522 
tgacaagctg ccgcagttta agcaagtgct 3582 
gccttgccag cagaaaatcg cgtctggcga 3642 
gacctgttgg ccggacgacg ccgcgccgct 3702 
tccgcacaaa gagcggcaaa acctgggcat 3762 
gctgattatg cgctggctgt ctcaccgcgg 3822 
cgcgcgtccg ggtgaacgtt tcccggtctc 3882 
cttggcgccg tgactcctgt tcccgatact 3942 
cgcggcacga aaactgaagt ggttaatgct 4002 
tatcttgaag tacatgagcg gagagatgcg 4062 
gagtgatagc tcgtcttacg tcacgcaata 4122 
cgcgcatgag ctcgtatagg tgcctcatat 4182 
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<210> 18 
<211> 166 
<212> PRT 

<213> Salmonella typhimurium 
<400> 18 

Met Ala Val Glu Asp Thr Gin Pro Leu He Thr His Leu He Glu Leu 
15 10 15 

Arg Lys Arg Leu Leu Asn Cys He Val Ala Val Leu Leu He Phe Leu 
20 25 30 

Ala Leu He Tyr Phe Ala Asn Asp He Tyr His Leu Val Ala Ala Pro 
35 40 45 

Leu He Lys Gin Met Pro Gin Gly Ala Thr Met He Ala Thr Asp Val 
50 55 60 

Ala Ser Pro Phe Phe Thr Pro He Lys Leu Thr Phe Met Val Ser Leu 
65 70 75 80 

He Leu Ser Ala Pro Val He Leu Tyr Gin Val Trp Ala Phe He Ala 
85 90 95 

Pro Ala Leu Tyr Lys His Glu Arg Arg Leu Val Val Pro Leu Leu Val 
100 105 110 

Ser Ser Ser Leu Leu Phe Tyr He Gly Met Ala Phe Ala Tyr Phe Val 
115 120 125 

Val Phe Pro Leu Ala Phe Gly Phe Leu Thr His Thr Ala Pro Glu Gly 
130 135 140 

Val Gin Val Ser Thr Asp He Ala Ser Tyr Leu Ser Phe Val Met Ala 
145 150 155 160 

Leu Phe Met Ala Phe Ala 
165 



<210> 19 
<211> 91 
<212> PRT 

<213> Salmonella typhimurium 
<400> 19 

Phe Glu Val Pro Val Ala He Val Leu Leu Cys Trp Met Gly He Thr 
15 10 15 
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Thr Pro Glu Asp Leu Arg Lys Lys Arg Pro Tyr lie Leu Val Gly Ala 
20 25 30 

Phe lie Val Gly Met Leu Leu Thr Pro Pro Asp Val Phe Ser Gin Thr 
35 40 45 

Leu Leu Ala lie Pro Met Tyr Cys Leu Phe Glu lie Gly Val Phe Cys 
50 55 60 

Ser Arg Phe Tyr Val Gly Lys Arg Arg Thr Arg Asp Glu Asp Asn Glu 
65 70 75 80 

Ala Glu Thr Glu Lys Ala Glu His Thr Glu Asp 
85 90 



<210> 20 
<211> 2601 
<212> DNA 

<213> Neisseria meningitidis 

<220> 
<221> CDS 

<222> (1572) (2339) 
<400> 20 

agacaaaatc ctaaaaaaag tgattgaaga ggcgggcgaa gtgttgatgg catccaaaga 60 
caaaaacccg tcccacctgg tttacgaagt tgccgactta tggtttcaca ccatgattct 120 
tctgacacac cacgacctga aggcggaaga cgtattggac gaacttgcgc gccgccaagg 180 
tttgtcgggc ttggccgaaa aagccgctcg cacagaatct tgaatttata ttaaaatccg 240 
cactttccca cattcaatcc gtctgaccgc tgttcagacg gcatcggagc cgttatggac 300 
aactgtattt tctgcaaaat cgccgccaaa gacattccgg cgcaaaccgt ctatgaagac 360 
ggcgaaatgg tttgtttcaa agacatcaac cccgctgctc cggttcatct gctgctgatt 420 
cccaaagtcc atttcgattc gttggcacac gccgcgcccg aacatcagcc ccttttggga 480 
aaaatgatgc tgaaagttcc cgaaatcgcc aaagcggcag gactggcaga cggcttcaaa 540 
accctgatca acaccggaaa aggcggcgga caagaggtct tccacctgca tatacacatc 600 
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atgggcacac ccgtataaac cgttatttca caatcaaccc ctaatactta cttaaggata 660 

catcatgggc agtttttctc tgacgcactg gattatcgta ctgattatcg tcgttttgat 720 

attcggcacc aaaaaactgc gcaacgtcgg caaagacctc ggcggtgcgg ttcatgactt 780 

caaacagggg ctgaacgaag gtacagacgg caaagaagcc caaaaagacg atgtaatcga 840 

acacaaaaaa gacgaagaca aagcgtaatt tatgtttgat ttcggtttgg gcgagctggt 900 

ttttgtcggc attatcgccc tgattgtcct cggccccgaa cgcctgcccg aggccgcccg 960 

caccgccgga cggctcatcg gcaggctgca acgctttgtc ggcagcgtca aacaggaatt 1020 

tgacacgcaa atcgaactgg aagaactaag gaaggcaaag caggaatttg aagctgccgc 1080 

tgctcaggtt cgagacagcc tcaaagaaac cggtacggat atggagggta atctgcacga 1140 

catttccgac ggtctgaagc cttgggaaaa actgcccgaa cagcgcacgc ctgctgattt 1200 

cggtgtcgat gaaaacggca atccctttcc cgatgcggca aacaccctat tagacggcat 1260 

ttccgacgtt atgccgtccg aacgttccta cgcttccgcc gaaacccttg gggacagcgg 1320 

gcaaaccggc agtacagccg aacccgcgga aaccgaccaa gaccgtgcat ggcgggaata 1380 

cctgactgct tctgccgccg cacccgtcgt acagaccgtc gaagtcagct atatcgatac 1440 

cgctgttgaa acccctgttc cgcataccac ttcgctgcgt aaacaggcaa taagccgcaa 1500 

acgcgatttg cgtcctaaat cccgcgccaa acctaaattg cgcgtccgta aatcataaag 1560 

agggcaatcc g gtg tec gaa aca caa aac gaa caa ccc gtc caa ccg ctt 1610 
Val Ser Glu Thr Gin Asn Glu Gin Pro Val Gin Pro Leu 
15 10 

gtc gag cat etc ate gag ctg cgc cgc cgc ctg atg tgg acg gtt gtc 1658 
Val Glu His Leu lie Glu Leu Arg Arg Arg Leu Met Trp Thr Val Val 
15 20 25 

ggt ate tta gtc tgc ttt ttc ggc eta atg ccg ttt gee caa caa etc 1706 
Gly He Leu Val Cys Phe Phe Gly Leu Met Pro Phe Ala Gin Gin Leu 
30 35 40 45 

tat act ttt ate gee gac cog ctg atg gca aac ctg ccc aaa gac acc 1754 
Tyr Thr Phe He Ala Asp Pro Leu Met Ala Asn Leu Pro Lys Asp Thr 
50 55 60 
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age atg att gec acc gat gtc ate gca cca ttt ttc gtg ccg gtc aaa 
Ser Met He Ala Thr Asp Val He Ala Pro Phe Phe Val Pro Val Lys 



gtt acc ctg atg gcg gca ttt tta att teg ctg ccg cat acg etc tac 
Val Thr Leu Met Ala Ala Phe Leu He Ser Leu Pro His Thr Leu Tyr 



caa ate tgg gca ttc gtc gec ccc gca etc tac caa aac gaa aaa cgc 
Gin lie Trp Ala Phe Val Ala Pro Ala Leu Tyr Gin Asn Glu Lys Arg 
95 100 105 

ctg att acg ccg etc gtc etc tec age gtc age ctg ttt ttc ate ggc 
Leu He Thr Pro Leu Val Leu Ser Ser Val Ser Leu Phe Phe He Gly 
110 115 120 125 

atg gca ttt gec tac ttt ttg gtt ttc ccc gtc att ttc aaa ttc ctt 
Met Ala Phe Ala Tyr Phe Leu Val Phe Pro Val He Phe Lys Phe Leu 
130 135 140 

gec age gtt acc cct gtc ggt gtc aat atg gcg aca gac ate gac aaa 
Ala Ser Val Thr Pro Val Gly Val Asn Met Ala Thr Asp He Asp Lys 
145 150 155 

tac etc tec ttc ate ttg ggg atg ttt gtc gca ttc ggt aca acg ttt 
Tyr Leu Ser Phe He Leu Gly Met Phe Val Ala Phe Gly Thr Thr Phe 
160 165 170 

gaa gtc ccc att gtc gtt ate ctg tta acc aaa att ggt gtg gta aca 
Glu Val Pro He Val Val He Leu Leu Thr Lys He Gly Val Val Thr 
175 180 185 

acc gaa cag etc aaa cgc gee cgc ccc tat gtg att gtc ggc gcg ttt 
Thr Glu Gin Leu Lys Arg Ala Arg Pro Tyr Val He Val Gly Ala Phe 
190 195 200 205 

gtc att gec gec ate ate acg ccg ccc gat gtg att tea caa acc ctg 
Val He Ala Ala He He Thr Pro Pro Asp Val He Ser Gin Thr Leu 
210 215 220 

ctt gee att ccg ctg att etc tta tac gaa gca ggt att tgg ttc gga 
Leu Ala He Pro Leu He Leu Leu Tyr Glu Ala Gly He Trp Phe Gly 
225 " 230 235 

cgc ttt ttc acg cca cgt tea gaa cag gat ggc gac ata cag ccg cct 
Arg Phe Phe Thr Pro Arg Ser Glu Gin Asp Gly Asp He Gin Pro Pro 
240 245 250 
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gca aca acc tgacactatg ccgtccgaac ctccgcctca taccgccaca 2379 
Ala Thr Thr 
255 

gattaaggaa tacctttgaa taccctctat ttaggttcaa acagcccgcg ccgaatggaa 2439 

atcctgacac agttgggcta tcaggtcgtc aagctgcctg ccaacatcga cgaaacggtc 2499 

agacagaacg aagaccctgc ccgttacgtt caaaggatgg cagaagaaaa aaaccgaacc 2559 
gccctgaccc tcttttgcga aaccaacggc acaatgcccg at 2601 



<210> 21 
<211> 256 
<212> PRT 

<213> Neisseria meningitidis 
<40O> 21 

Val Ser Glu Thr Gin Asn Glu Gin Pro Val Gin Pro Leu Val Glu His 
15 10 15 

Leu lie Glu Leu Arg Arg Arg Leu Met Trp Thr Val Val Gly lie Leu 
20 25 30 

Val cys Phe Phe Gly Leu Met Pro Phe Ala Gin Gin Leu Tyr Thr Phe 



He Ala Asp Pro Leu Met Ala Asn Leu Pro Lys Asp Thr Ser Met He 
50 55 60 

Ala Thr Asp Val He Ala Pro Phe Phe Val Pro Val Lys Val Thr Leu 
65 70 75 80 

Met Ala Ala Phe Leu He Ser Leu Pro His Thr Leu Tyr Gin He Trp 
85 90 95 

Ala Phe Val Ala Pro Ala Leu Tyr Gin Asn Glu Lys Arg Leu He Thr 
100 105 110 

Pro Leu Val Leu Ser Ser Val Ser Leu Phe Phe He Gly Met Ala Phe 
115 120 125 

Ala Tyr Phe Leu Val Phe Pro Val He Phe Lys Phe Leu Ala Ser Val 
130 135 140 

Thr Pro Val Gly Val Asn Met Ala Thr Asp lie Asp Lys Tyr Leu Ser 
145 150 155 160 
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Phe lie Leu Gly Met Phe Val Ala Phe Gly Thr Thr Phe Glu Val Pro 
165 170 175 

lie Val Val He Leu Leu Thr Lys He Gly Val Val Thr Thr Glu Gin 
180 185 190 

Leu Lys Arg Ala Arg Pro Tyr Val He Val Gly Ala Phe Val He Ala 
195 200 205 

Ala He He Thr Pro Pro Asp Val He Ser Gin Thr Leu Leu Ala He 
210 215 220 

Pro Leu He Leu Leu Tyr Glu Ala Gly He Trp Phe Gly Arg Phe Phe 
225 230 235 240 

Thr Pro Arg Ser Glu Gin Asp Gly Asp He Gin Pro Pro Ala Thr Thr 
245 250 255 



<210> 22 
<211> 4604 
<212> DNA 

<213> Escherichia coli 

<220> 
<221> CDS 

<222> (2982) . . (4082) 

<220> 
<221> CDS 

<222> (1534) .. (2637) 

<220> 
<221> CDS 

<222> (749) (1531) 

<220> 

<221> CDS 

<222> (6).. (746) 

<400> 22 

ggcta gtt gat gat aat ttg aaa ggt caa ggt gca gga aaa aat ttt tta 50 

Val Asp Asp Asn Leu Lys Gly Gin Gly Ala Gly Lys Asn Phe Leu 
15 10 15 

teg ctg ata aag tac age gag aca gat tat aca att tat tgt gac caa 98 
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er Leu lie Lys Tyr Ser Glu Thr Asp Tyr Thr lie Tyr Cys Asp Gin 



gat gat att tgg tta gaa aac aaa ata ttt gaa tta gta aag tat gca 

Asp Asp lie Trp Leu Glu Asn Lys He Phe Glu Leu Val Lys Tyr Ala 

35 40 45 

aat gaa att aaa ttg aat gta tea gat gcg cct teg eta gtt tat get 

Asn Glu He Lys Leu Asn Val Ser Asp Ala Pro Ser Leu Val Tyr Ala 

50 55 60 

gat ggc tat get tat atg gat ggt gag ggt aca ate gat ttt tct ggg 

Asp Gly Tyr Ala Tyr Met Asp Gly Glu Gly Thr He Asp Phe Ser Gly 



ata tct aac aat cat get gat caa tta aag gat ttt ctt ttt ttt aat 
He Ser Asn Asn His Ala Asp Gin Leu Lys Asp Phe Leu Phe Phe Asn 
80 85 90 95 

ggt gga tac caa gga tgt tct att atg ttc aat cgt gca atg acc aaa 
Gly Gly Tyr Gin Gly Cys Ser He Met Phe Asn Arg Ala Met Thr Lys 
100 105 no 

ttt ctt ctg aat tat cga gga ttt gta tat eta cat gac gat ate aca 
Phe Leu Leu Asn Tyr Arg Gly Phe Val Tyr Leu His Asp Asp He Thr 
115 120 125 

aca tta get gca tac get ctt ggt aaa gtt tat ttt etc ccg aaa tac 
Thr Leu Ala Ala Tyr Ala Leu Gly Lys Val Tyr Phe Leu Pro Lys Tyr 
130 135 140 

ctt atg tta tat aga cag cac acg aat gcg gta act ggt ate aaa aca 
Leu Met Leu Tyr Arg Gin His Thr Asn Ala Val Thr Gly He Lys Thr 
145 150 155 

ttc cgc aat gga ttg act tct aaa ttt aaa tea cca gta aac tat ctt 
Phe Arg Asn Gly Leu Thr Ser Lys Phe Lys Ser Pro Val Asn Tyr Leu 
160 165 170 175 

tta tea cga aaa cat tat cag gta aaa aaa tct ttt ttt gaa tgt aac 
Leu Ser Arg Lys His Tyr Gin Val Lys Lys Ser Phe Phe Glu Cys Asn 
180 185 190 

age tct ate tta tea gag acg aat aaa aaa gtt ttt ttg gat ttt att 
Ser Ser He Leu Ser Glu Thr Asn Lys Lys Val Phe Leu Asp Phe He 
195 200 205 

tea ttt tgt gaa tea aat aat aaa ttt aca gat ttt ttt aag tta tgg 
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Ser Phe Cys Glu Ser Asn Asn Lys Phe Thr Asp Phe Phe Lys Leu Trp 
210 215 220 

cga ggt ggg ttt aga tta aat aac agt aga act aaa tta tta tta aaa 722 
Arg Gly Gly Phe Arg Leu Asn Asn Ser Arg Thr Lys Leu Leu Leu Lys 
225 230 235 

ttc tta ata egg aga aaa ttt age ga atg att tea ata ctt aca cct 7 69 
Phe Leu He Arg Arg Lys Phe Ser Met He Ser He Leu Thr Pro 
240 245 250 

act ttt aat egg caa cat act tta tea agg eta ttc aat tct ctt ata 817 
Thr Phe Asn Arg Gin His Thr Leu Ser Arg Leu Phe Asn Ser Leu He 
255 260 265 270 

tta caa act gat aaa gat ttt gag tgg ata ata att gat gat ggt agt 865 
Leu Gin Thr Asp Lys Asp Phe Glu Trp He He He Asp Asp Gly Ser 
275 280 285 

ata gat gca aca gcg gta ctt gta gaa gat ttt aga aaa aaa tgt gat 913 
He Asp Ala Thr Ala Val Leu Val Glu Asp Phe Arg Lys Lys Cys Asp 
290 295 300 

ttt gac ttg att tat tgc tat cag gaa aat aat ggt aag ccc atg get 961 
Phe Asp Leu He Tyr Cys Tyr Gin Glu Asn Asn Gly Lys Pro Met Ala 
305 310 315 

tta aac get ggt gtt aaa get tgt aga ggc gat tat ate ttt att gtt 1009 
Leu Asn Ala Gly Val Lys Ala Cys Arg Gly Asp Tyr He Phe He Val 
320 325 330 

gac agt gat gat gca eta act ccc gat gec ata aaa tta att aaa gaa 1057 
Asp Ser Asp Asp Ala Leu Thr Pro Asp Ala He Lys Leu He Lys Glu 
335 340 345 350 

tea ata cat gat tgc tta tct gag aag gaa agt ttc age gga gtc ggt 1105 
Ser He His Asp Cys Leu Ser Glu Lys Glu Ser Phe Ser Gly Val Gly 
355 360 r 365 

ttt aga aaa gca tat ata aaa ggg ggg att att ggt aat gat tta aat 1153 
Phe Arg Lys Ala Tyr He Lys Gly Gly He He Gly Asn Asp Leu Asn 
370 375 380 

aat tct tea gaa cat ata tac tat tta aat gcg act gag att age aat 1201 
Asn ser Ser Glu His He Tyr Tyr Leu Asn Ala Thr Glu He Ser Asn 
385 390 395 

tta ata aat ggt gat gtt gca tat tgt ttt aaa aaa gaa agt ttg gta 1249 



WO 00/28038 



PCT/GB99/03721 



Leu lie Asn Gly Asp Val Ala Tyr Cys Phe Lys Lys Glu Ser Leu Val 
400 405 410 

aaa aat cca ttc ccc cgt ata gaa gat gaa aaa ttt gtt cca gaa tta 1297 

Lys Asn Pro Phe Pro Arg lie Glu Asp Glu Lys Phe Val Pro Glu Leu 
415 420 425 430 

tat att tgg aat aaa ata act gac aag gcg aag att cga ttt aac ata 1345 

Tyr lie Trp Asn Lys He Thr Asp Lys Ala Lys He Arg Phe Asn He 
435 440 445 

age aaa gtt ata tat ctt tgt gag tat ctt gat gat ggt ctt tct aaa 1393 

Ser Lys Val He Tyr Leu Cys Glu Tyr Leu Asp Asp Gly Leu Ser Lys 

450 455 460 

aat ttc cat aac cag ctt aaa aaa tac cca aag ggg ttt aag att tat 1441 

Asn Phe His Asn Gin Leu Lys Lys Tyr Pro Lys Gly Phe Lys He Tyr 
465 470 475 

tac aaa gat caa aga aaa cga gag aaa act tat ata aaa aaa aca aag 1489 

Tyr Lys Asp Gin Arg Lys Arg Glu Lys Thr Tyr He Lys Lys Thr Lys 
480 485 490 

atg eta att aga tat ttg caa tgt tgt tat tat gag aaa ata aa atg 1536 

Met Leu He Arg Tyr Leu Gin Cys Cys Tyr Tyr Glu Lys He Met 
495 500 505 

aaa ata eta ttt gtc att aca ggt tta ggc ctt gga ggt get gag aag 1584 

Lys He Leu Phe Val He Thr Gly Leu Gly Leu Gly Gly Ala Glu Lys 
510 515 520 525 

cag gtt tgt ctt tta get gat aaa tta agt tta age ggg cac cat gta 1632 

Gin Val Cys Leu Leu Ala Asp Lys Leu Ser Leu Ser Gly His His Val 
530 535 540 

aag att att tea ctt gga cat atg tct aat aat aaa gtc ttt cct age 1680 

Lys He He Ser Leu Gly His Met Ser Asn Asn Lys Val Phe Pro Ser 

545 550 555 

gaa aat aat gtt aat gtc att aat gta aat atg tea aaa aac att tct 1728 

Glu Asn Asn Val Asn Val He Asn Val Asn Met Ser Lys Asn He Ser 
560 565 570 

gga gtt ata aaa ggt tgt gtc aga att aga gat gtt ata get aat ttc 1776 

Gly Val He Lys Gly Cys Val Arg He Arg Asp Val He Ala Asn Phe 
575 580 585 

aaa cca gac att gta cac agt cat atg ttt cat gca aac att ate act 1824 
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Lys Pro Asp He Val His Ser His Met Phe His Ala Asn He He Thr 
590 595 600 605 

aga ttg tct gta att gga ate aaa aac aga cct ggt att ata tea act 
Arg Leu Ser Val He Gly He Lys Asn Arg Pro Gly He He Ser Thr 
610 615 620 

gca cat aat aaa aat gaa ggt ggg tat ttc aga atg etc aca tat aga 
Ala His Asn Lys Asn Glu Gly Gly Tyr Phe Arg Met Leu Thr Tyr Arg 
625 630 635 

ata acc gat tgt tta agt gat tgt tgt aca aat gtt age aaa gaa gca 
He Thr Asp Cys Leu Ser Asp Cys Cys Thr Asn Val Ser Lys Glu Ala 
640 645 650 

gtg gat gag ttt. tta egg ata aaa gec ttt aat ccc get aaa gca att 
Val Asp Glu Phe Leu Arg He Lys Ala Phe Asn Pro Ala Lys Ala He 
655 660 665 

act atg tat aat ggg ata gat acc aat aaa ttt aaa ttt gat tta ttg 
Thr Met Tyr Asn Gly He Asp Thr Asn Lys Phe Lys Phe Asp Leu Leu 
670 675 680 685 

gca agg agg gaa att cga gac ggt att aat ata aaa aat gat gat ata 
Ala Arg Arg Glu He Arg Asp Gly He Asn He Lys Asn Asp Asp He 
690 695 700 

tta tta ctt get gca ggt cgt tta acg tta get aaa gat tat cct aat 
Leu Leu Leu Ala Ala Gly Arg Leu Thr Leu Ala Lys Asp Tyr Pro Asn 
705 710 715 

tta ttg aat gca atg act ctg ctt cct gaa cac ttt aaa ctt att att 
Leu Leu Asn Ala Met Thr Leu Leu Pro Glu His Phe Lys Leu He He 
720 725 730 

att ggt gat ggt gaa ttg cgt gac gaa att aat atg ctt ata aaa aaa 
He Gly Asp Gly Glu Leu Arg Asp Glu He Asn Met Leu He Lys Lys 
735 740 745 

ttg caa tta tct aat agg gtg tec ttg ttg gga gtt aaa aaa aat att 
Leu Gin Leu Ser Asn Arg Val Ser Leu Leu Gly Val Lys Lys Asn He 
750 755 760 765 

get ccc tat ttt tct gca tgt gat att ttt gtt etc tct tct cgt tgg 
Ala Pro Tyr Phe Ser Ala Cys Asp He Phe Val Leu Ser Ser Arg Trp 
770 775 780 

gaa gga ttt gga tta gtc gtg gca gaa get atg tea tgt gag cga att 
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Glu Gly Phe Gly Leu Val Val Ala Glu Ala Met Ser Cys Glu Arg He 
785 790 795 

gtt gtt ggc acg gat tea ggg gga gta aga gaa gtt att ggt gac gat 2448 
Val Val Gly Thr Asp Ser Gly Gly Val Arg Glu Val He Gly Asp Asp 
800 805 810 

gat ttt ctt gta ccc ata tct gat tea aca caa ctt gca age aaa att 2496 
Asp Phe Leu Val Pro He Ser Asp Ser Thr Gin Leu Ala Ser Lys He 
815 820 825 

gaa aaa ttg tct ttg age cag ata cgt gat cac att ggt ttt egg aat 2544 
Glu Lys Leu Ser Leu Ser Gin He Arg Asp His He Gly Phe Arg Asn 
830 835 840 8 45 

cgt gag cgt att tta aaa aat ttc tea ata gat act att att atg cag 2592 
Arg Glu Arg He Leu Lys Asn Phe Ser He Asp Thr He He Met Gin 
850 855 860 

tgg caa gaa etc tat gga act ata att tgc tea aaa cat gaa agg 2637 
Trp Gin Glu Leu Tyr Gly Thr He He Cys Ser Lys His Glu Arg 
865 870 875 

tagatttata tttggaacgt gtcttttgtt tgaatttaat tcaatctcaa ttgagatttt 2697 

tgtatttcaa aaataccatc atagctaacg atgattggta tttattttaa gatgetttet 2757 

ataaatatat tgacgttttt aatgegcega aacgattggg ctgggaacag agaagtaaaa 2817 

ctgttttgag aatgaagagt ttttgagatg tttatggata ttaaaaattg atccagtgaa 2877 

ttaattattt ataataaatc aagatttaat gttaataaat gataatcttt tctgacactc 2937 

atattaatta tgagtggtac gtttggtaaa eggtaaacta ttat atg aca get aga 2993 

Met Thr Ala Arg 
880 

aca act aaa gtt ttg cac tta caa tta etc cca etc tta agt ggc gtt 3041 
Thr Thr Lys Val Leu His Leu Gin Leu Leu Pro Leu Leu Ser Gly Val 
885 890 895 

caa agg gta aca tta aac gaa att agt gcg tta tat act gat tat gat 3089 
Gin Arg Val Thr Leu Asn Glu He Ser Ala Leu Tyr Thr Asp Tyr Asp 
900 905 910 

tat aca eta gtt tgc tea aaa aaa ggt cca eta aca aaa gca ttg ctg 3137 
Tyr Thr Leu Val Cys Ser Lys Lys Gly Pro Leu Thr Lys Ala Leu Leu 
915 920 925 
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gaa tat gat gtc gat tgt cat tgt ate ccc gaa ctt acg aga gaa att 3185 
Glu Tyr Asp Val Asp Cys His Cys lie Pro Glu Leu Thr Arg Glu lie 
530 935 940 

acc gta aag aat gat ttt aaa gca ttg ttc aag ctt tat aag ttc ata 3233 
Thr Val Lys Asn Asp Phe Lys Ala Leu Phe Lys Leu Tyr Lys Phe He 
945 950 955 960 

aaa aaa gaa aaa ttt gac att gtg cat aca cat tct tea aaa aca ggt 3281 
Lys Lys Glu Lys Phe Asp He Val His Thr His Ser Ser Lys Thr Gly 
965 970 975 

att ttg ggg cga gtt get gec aaa tta gca cgt gtt gga aag gtg ate 3329 
He Leu Gly Arg Val Ala Ala Lys Leu Ala Arg Val Gly Lys Val He 
980 985 990 

cac act gta cat ggt ttt tct ttt cca gec gca tct agt aaa aaa agt 3377 
His Thr Val His Gly Phe Ser Phe Pro Ala Ala Ser Ser Lys Lys Ser 
995 1000 1005 

tat tac ctt tat ttt ttc atg gaa tgg ata gca aag ttc ttt acg gat 3425 
Tyr Tyr Leu Tyr Phe Phe Met Glu Trp He Ala Lys Phe Phe Thr Asp 
1010 1015 1020 

aag tta ate gtc ttg aat gta gat gat gaa tat ata gca ata aac aaa 3473 
Lys Leu He Val Leu Asn Val Asp Asp Glu Tyr He Ala He Asn Lys 
1025 1030 1035 1040 

tta aaa ttc aag egg gat aaa gtt ttt tta att cct aat gga gta gac 3521 
Leu Lys Phe Lys Arg Asp Lys Val Phe Leu He Pro Asn Gly Val Asp 
1045 1050 1055 

act gat aag ttt tct cct tta gaa aat aaa att tat agt age acc ttg 3569 
Thr Asp Lys Phe Ser Pro Leu Glu Asn Lys He Tyr Ser Ser Thr Leu 
1060 1065 1070 

aat eta gta atg gtt ggt aga tta tec aag caa aaa gat cct gag aca 3617 
Asn Leu Val Met Val Gly Arg Leu Ser Lys Gin Lys Asp Pro Glu Thr 
1075 1080 1085 

tta ttg ctt get gtt gaa aaa ctg ctg aat gaa aat gtt aat gtt aag 3665 
Leu Leu Leu Ala Val Glu Lys Leu Leu Asn Glu Asn Val Asn Val Lys 
1090 1095 HOO 

ctg aca ctt gta gga gat ggt gaa eta aaa gaa cag tta gaa age agg 3713 
Leu Thr Leu Val Gly Asp Gly Glu Leu Lys Glu Gin Leu Glu Ser Arg 
1105 1110 ins 1120 
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ttc aaa egg caa gat gga cgt ata att ttt cat gga tgg tea gat aac 3761 
Phe Lys Arg Gin Asp Gly Arg lie lie Phe His Gly Trp Ser Asp Asn 
1125 1130 1135 

att gtt aat att tta aaa gtt aat gat ctt ttt ata tta cct tct ctt 3809 
lie Val Asn lie Leu Lys Val Asn Asp Leu Phe lie Leu Pro Ser Leu 
1140 1145 1150 

tgg gag ggt atg cca tta gca att tta gaa gca ttg age tgt gga ctt 3857 
Trp Glu Gly Met Pro Leu Ala lie Leu Glu Ala Leu Ser Cys Gly Leu 
1155 1160 1165 

cca tgt ata gtc act aat att cca ggt aat aat age tta ata gaa gat 3905 
Pro Cys lie Val Thr Asn lie Pro Gly Asn Asn Ser Leu lie Glu Asp 
1170 1175 1180 

ggc tat aat ggt tgt ttg ttt gaa att aga gat tgt cag tta tta tct 3953 
Gly Tyr Asn Gly Cys Leu Phe Glu lie Arg Asp Cys Gin Leu Leu Ser 
1185 1190 1195 1200 

caa aaa ate atg tea tat gtt ggt aag cca gaa ctg att gca cag caa 4001 
Gin Lys He Met Ser Tyr Val Gly Lys Pro Glu Leu He Ala Gin Gin 
1205 1210 1215 

tct ace aat gca cga tea ttt att ctg aaa aat tat gga tta gtt aaa 4049 
Ser Thr Asn Ala Arg Ser Phe He Leu Lys Asn Tyr Gly Leu Val Lys 
1220 1225 1230 

aga aat aat aag gtc aga cag eta tat gat aat taaatgaaac cgaaaagtta 4102 
Arg Asn Asn Lys Val Arg Gin Leu Tyr Asp Asn 
1235 1240 

aaaaagaaca ggtttttcaa agtgaaaata aaattacagt ttttttattg caatgattaa 4162 

egtaacatet gcattacatt caagccgcac aaccccgcgg tgaccacccc tgacaggagt 4222 

aaacaatgtc aaagcaacag ateggegteg teggtatgge agtgatggga cgcaacctcg 4282 

cgctcaacat egaaagcegt ggttataccg tctctatttt caaccgttcc cgtgaaaaga 4342 

eggaagaagt tattgecgaa aatccaggca agaaactggt tccttactat acggtgaaag 4402 

agttcgttga atctcttgaa acgcctcgtc gcatcctgtt aatgggttaa ageaggtgea 4462 

ggcacggatg ctgctattga ttccctgaaa ccatatctcg ataaaggega tatcatcatt 4522 

gatgggtggg taataccttc tttcaggaca ccattcgtcg taaccgegag ctttctgcac 4582 
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aaggctttac ttcatcggta cc 



<210> 23 
<211> 247 
<212> PRT 

<213> Escherichia coli 
<400> 23 

Val Asp Asp Asn Leu Lys Gly Gin Gly Ala Gly Lys Asn Phe Leu Ser 
15 10 15 

Leu lie Lys Tyr Ser Glu Thr Asp Tyr Thr lie Tyr Cys Asp Gin Asp 
20 25 30 

Asp lie Trp Leu Glu Asn Lys He Phe Glu Leu Val Lys Tyr Ala Asn 
35 40 45 

Glu He Lys Leu Asn Val Ser Asp Ala Pro Ser Leu Val Tyr Ala Asp 
50 55 60 

Gly Tyr Ala Tyr Met Asp Gly Glu Gly Thr He Asp Phe Ser Gly He 
€5 70 75 80 

Ser Asn Asn His Ala Asp Gin Leu Lys Asp Phe Leu Phe Phe Asn Gly 
85 90 95 

Gly Tyr Gin Gly Cys Ser He Met Phe Asn Arg Ala Met Thr Lys Phe 
100 105 HO 

Leu Leu Asn Tyr Arg Gly Phe Val Tyr Leu His Asp Asp He Thr Thr 
115 120 125 

Leu Ala Ala Tyr Ala Leu Gly Lys Val Tyr Phe Leu Pro Lys Tyr Leu 
130 135 140 

Met Leu Tyr Arg Gin His Thr Asn Ala Val Thr Gly He Lys Thr Phe 
145 150 155 160 



Arg Asn Gly Leu Thr Ser Lys Phe 
165 

Ser Arg Lys His Tyr Gin Val Lys 
180 

Ser He Leu Ser Glu Thr Asn Lys 
195 200 



Lys Ser Pro Val Asn Tyr Leu Leu 
170 175 

Lys Ser Phe Phe Glu Cys Asn Ser 
185 190 

Lys Val Phe Leu Asp Phe He Ser 
205 
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Phe Cys Glu Ser Asn Asn Lys Phe Thr Asp Phe Phe Lys Leu Trp Arg 
210 215 220 

Gly Gly Phe Arg Leu Asn Asn Ser Arg Thr Lys Leu Leu Leu Lys Phe 
225 230 235 240 

Leu lie Arg Arg Lys Phe Ser 
245 



<210> 24 
<211> 261 
<212> PRT 

<213> Escherichia coli 
<400> 24 

Met He Ser He Leu Thr Pro Thr Phe Asn Arg Gin His Thr Leu Ser 



Arg Leu Phe Asn Ser Leu He Leu Gin Thr Asp Lys Asp Phe Glu Trp 

20 25 30 

He He He Asp Asp Gly Ser He Asp Ala Thr Ala Val Leu Val Glu 

35 40 45 

Asp Phe Arg Lys Lys Cys Asp Phe Asp Leu He Tyr Cys Tyr Gin Glu 

50 55 60 

Asn Asn Gly Lys Pro Met Ala Leu Asn Ala Gly Val Lys Ala Cys Arg 

65 70 75 80 

Gly Asp Tyr He Phe He Val Asp Ser Asp Asp Ala Leu Thr Pro Asp 

85 90 95 

Ala He Lys Leu He Lys Glu Ser He His Asp Cys Leu Ser Glu Lys 

100 105 no 

Glu Ser Phe Ser Gly Val Gly Phe Arg Lys Ala Tyr He Lys Gly Gly 

115 120 125 

He He Gly Asn Asp Leu Asn Asn Ser Ser Glu His He Tyr Tyr Leu 

130 135 - 140 

Asn Ala Thr Glu He Ser Asn Leu He Asn Gly Asp Val Ala Tyr Cys 

145 150 155 160 

Phe Lys Lys Glu Ser Leu Val Lys Asn Pro Phe Pro Arg He Glu Asp 
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165 170 175 

Glu Lys Phe Val Pro Glu Leu Tyr lie Trp Asn Lys lie Thr Asp Lys 
180 185 190 

Ala Lys lie Arg Phe Asn lie Ser Lys Val lie Tyr Leu Cys Glu Tyr 
195 200 205 

Leu Asp Asp Gly Leu Ser Lys Asn Phe His Asn Gin Leu Lys Lys Tyr 
210 215 220 

Pro Lys Gly Phe Lys lie Tyr Tyr Lys Asp Gin Arg Lys Arg Glu Lys 
225 230 235 240 

Thr Tyr He Lys Lys Thr Lys Met Leu He Arg Tyr Leu Gin Cys Cys 
245 250 255 

Tyr Tyr Glu Lys He 
260 



<210> 25 
<211> 368 
<212> PRT 

<213> Escherichia coli 
<400> 25 

Met Lys He Leu Phe Val He Thr Gly Leu Gly Leu Gly Gly Ala Glu 
15 10 15 

Lys Gin Val Cys Leu Leu Ala Asp Lys Leu Ser Leu Ser Gly His His 
20 25 30 

Val Lys He He Ser Leu Gly His Met Ser Asn Asn Lys Val Phe Pro 
35 40 45 

Ser Glu Asn Asn Val Asn Val lie Asn Val Asn Met Ser Lys Asn He 
50 55 60 

Ser Gly Val He Lys Gly Cys Val Arg He Arg Asp Val He Ala Asn 
65 70 75 80 

Phe Lys Pro Asp He Val His Ser His Met Phe His Ala Asn lie He 
85 90 95 

Thr Arg Leu Ser Val He Gly He Lys Asn Arg Pro Gly lie He Ser 
100 105 110 
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Thr Ala His Asn Lys Asn Glu Gly Gly Tyr Phe Arg Met Leu Thr Tyr 
115 120 125 

Arg lie Thr Asp Cys Leu Ser Asp Cys Cys Thr Asn Val Ser Lys Glu 
130 135 140 

Ala Val Asp Glu Phe Leu Arg lie Lys Ala Phe Asn Pro Ala Lys Ala 
145 150 155 160 

lie Thr Met Tyr Asn Gly lie Asp Thr Asn Lys Phe Lys Phe Asp Leu 
165 170 175 

Leu Ala Arg Arg Glu lie Arg Asp Gly lie Asn lie Lys Asn Asp Asp 
180 185 190 

lie Leu Leu Leu Ala Ala Gly Arg Leu Thr Leu Ala Lys Asp Tyr Pro 
195 200 205 

Asn Leu Leu Asn Ala Met Thr Leu Leu Pro Glu His Phe Lys Leu lie 
210 215 220 

lie He Gly Asp Gly Glu Leu Arg Asp Glu He Asn Met Leu He Lys 
225 230 235 240 

Lys Leu Gin Leu Ser Asn Arg Val Ser Leu Leu Gly Val Lys Lys Asn 
245 250 255 

He Ala Pro Tyr Phe Ser Ala Cys Asp He Phe Val Leu Ser Ser Arg 
260 265 270 

Trp Glu Gly Phe Gly Leu Val Val Ala Glu Ala Met Ser Cys Glu Arg 
275 280 285 

He Val Val Gly Thr Asp Ser Gly Gly Val Arg Glu Val He Gly Asp 
290 295 300 

Asp Asp Phe Leu Val Pro He Ser Asp Ser Thr Gin Leu Ala Ser Lys 
305 310 315 320 

He Glu Lys Leu Ser Leu Ser Gin He Arg Asp His He Gly Phe Arg 
325 330 .335 

Asn Arg Glu Arg He Leu Lys Asn Phe Ser He Asp Thr He He Met 
340 345 350 

Gin Trp Gin Glu Leu Tyr Gly Thr He He Cys Ser Lys His Glu Arg 
355 360 365 
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<210> 26 
<211> 367 
<212> PRT 

<213> Escherichia coli 
<400> 26 

Met Thr Ala Arg Thr Thr Lys Val Leu His Leu Gin Leu Leu Pro Leu 



Leu Ser Gly Val Gin Arg Val Thr Leu Asn Glu lie Ser Ala Leu Tyr 
20 25 30 

Thr Asp Tyr Asp Tyr Thr Leu Val Cys Ser Lys Lys Gly Pro Leu Thr 
35 40 45 

Lys Ala Leu Leu Glu Tyr Asp Val Asp Cys His Cys lie Pro Glu Leu 
50 55 60 

Thr Arg Glu lie Thr Val Lys Asn Asp Phe Lys Ala Leu Phe Lys Leu 
65 70 75 80 

Tyr Lys Phe lie Lys Lys Glu Lys Phe Asp lie Val His Thr His Ser 
85 90 95 

Ser Lys Thr Gly He Leu Gly Arg Val Ala Ala Lys Leu Ala Arg Val 
100 105 110 

Gly Lys Val He His Thr Val His Gly Phe Ser Phe Pro Ala Ala Ser 
115 120 125 

Ser Lys Lys Ser Tyr Tyr Leu Tyr Phe Phe Met Glu Trp He Ala Lys 
130 135 140 

Phe Phe Thr Asp Lys Leu He Val Leu Asn Val Asp Asp Glu Tyr He 
145 150 155 160 

Ala He Asn Lys Leu Lys Phe Lys Arg Asp Lys Val Phe Leu He Pro 
165 170 175 

Asn Gly Val Asp Thr Asp Lys Phe Ser Pro Leu Glu Asn Lys lie Tyr 
180 185 190 

Ser Ser Thr Leu Asn Leu Val Met Val Gly Arg Leu Ser Lys Gin Lys 
195 200 205 

Asp Pro Glu Thr Leu Leu Leu Ala Val Glu Lys Leu Leu Asn Glu Asn 
210 215 220 
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Val Asn Val Lys Leu Thr Leu Val Gly Asp Gly Glu Leu Lys Glu Gin 
225 230 235 240 

Leu Glu ser Arg Phe Lys Arg Gin Asp Gly Arg lie He Phe His Gly 
245 250 255 

Trp Ser Asp Asn He Val Asn He Leu Lys Val Asn Asp Leu Phe He 
260 265 270 

Leu Pro Ser Leu Trp Glu Gly Met Pro Leu Ala He Leu Glu Ala Leu 
275 280 285 

Ser Cys Gly Leu Pro Cys He Val Thr Asn He Pro Gly Asn Asn Ser 
290 295 300 

Leu He Glu Asp Gly Tyr Asn Gly Cys Leu Phe Glu He Arg Asp Cys 
305 310 315 320 

Gin Leu Leu Ser Gin Lys He Met Ser Tyr Val Gly Lys Pro Glu Leu 
325 330 335 

He Ala Gin Gin Ser Thr Asn Ala Arg Ser Phe He Leu Lys Asn Tyr 
340 345 350 

Gly Leu Val Lys Arg Asn Asn Lys Val Arg Gin Leu Tyr Asp Asn 
355 360 365 



<210> 27 
<211> 1272 
<212> DNA 

<213> Escherichia coli 

<220> 
<221> CDS 

<222> (319) (1269) 

<220> 

<221> CDS 

<222> (3) . . (215) 

<400> 27 

cc ggg aag cac teg gcg ctg att gtt gca cat cgt ctg acc acc gcg 47 

Gly Lys His Ser Ala Leu He Val Ala His Arg Leu Thr Thr Ala 
15 10 15 
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caa cgc tgc gat ctg att gcc gtt att gat aag ggg tta ctt gcg gaa 95 
Gin Arg Cys Asp Leu lie Ala Val lie Asp Lys Gly Leu Leu Ala Glu 
20 25 30 



tac gga acc cac gaa cag ctg tta tct gcg ggc ggc etc tat acc cgc 
Tyr Gly Thr His Glu Gin Leu Leu Ser Ala Gly Gly Leu Tyr Thr Arg 
35 40 45 



tta tgg cat gac age gtc age agt act get etc cat cgc cag cac aac 191 
Leu Trp His Asp Ser Val Ser Ser Thr Ala Leu His Arg Gin His Asn 



atg aag gag gaa acc ccg gga tag ttactggaca cgtaatgtat taaaaacaca 245 
Met Lys Glu Glu Thr Pro Gly 
65 70 



gtcagaagcg gcggtaccgt gaatagcege tttaattatt tatactgaca tccttaattt 305 



ttaaagagta tga atg ctg aac atg caa caa cat etc tct get ate gcc 354 
Met Leu Asn Met Gin Gin His Leu Ser Ala He Ala 
75 80 



age ctg cgc aac caa ctg gca gcg ggc cac att get aac ctt act gac 402 
Ser Leu Arg Asn Gin Leu Ala Ala Gly His He Ala Asn Leu Thr Asp 
85 90 95 



ttc tgg cgc gaa get gag teg ctg aat gtt cct ctt gtg acg cca gtc 450 
Phe Trp Arg Glu Ala Glu Ser Leu Asn Val Pro Leu Val Thr Pro Val 
100 105 110 115 



gaa gga gcg gaa gat gag cga gaa gtg acc ttt ctg tgg cgc gcc cga 498 
Glu Gly Ala Glu Asp Glu Arg Glu Val Thr Phe Leu Trp Arg Ala Arg 
120 125 130 



cat cct ctg cag ggc gtt tat ctg cgt ctg aac egg gtg acg gat aaa 546 
His Pro Leu Gin Gly Val Tyr Leu Arg Leu Asn Arg Val Thr Asp Lys 
135 140 145 



gag cac gta gaa aaa gga atg atg age gcc ctt ccc gaa acg gat ate 594 
Glu His Val Glu Lys Gly Met Met Ser Ala Leu Pro Glu Thr Asp He 
150 155 160 



tgg aca ctg aca ctg cgt tta ccc gca agt tac tgc ggc tec tat teg 642 
Trp Thr Leu Thr Leu Arg Leu Pro Ala Ser Tyr Cys Gly Ser Tyr Ser 
165 170 175 



ctg ctg gaa ate ccc ccc ggc act acg get gag acg att gca ctg tec 690 
Leu Leu Glu He Pro Pro Gly Thr Thr Ala Glu Thr He Ala Leu Ser 
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180 185 190 195 

gga ggc cgt ttt gcc acc ctt gcc gga aag gcc gat ccg eta aac aaa 
Gly Gly Arg Phe Ala Thr Leu Ala Gly Lys Ala Asp Pro Leu Asn Lys 
200 205 210 

atg ccg gag ate aac gtt egg gga aac gca aag gaa tea gtg ctg aca 
Met Pro Glu lie Asn Val Arg Gly Asn Ala Lys Glu Ser Val Leu Thr 
215 220 225 

ctt gat aaa get ccc gcc ctg teg gaa tgg aac ggc ggc ttc cac acc 
Leu Asp Lys Ala Pro Ala Leu Ser Glu Trp Asn Gly Gly Phe His Thr 
230 235 240 

gga caa ctg ctt acc tec atg cgc att ate gcc ggg aaa tct cgc cag 
Gly Gin Leu Leu Thr Ser Met Arg lie lie Ala Gly Lys Ser Arg Gin 
245 250 255 

gtt egg etc tat att ccg gat gtt gat att tct cag ccc etc ggg ctg 
Val Arg Leu Tyr lie Pro Asp Val Asp lie Ser Gin Pro Leu Gly Leu 
260 265 270 275 

gtc gtg ctg ccc gat ggt gaa acc tgg ttt gat cac ctt ggc gta tgc 
Val Val Leu Pro Asp Gly Glu Thr Trp Phe Asp His Leu Gly Val Cys 
280 285 290 

gcg gca att gac gcc gcc ata aat aat ggg cgc ate gtg ccc gtg get 
Ala Ala He Asp Ala Ala lie Asn Asn Gly Arg He Val Pro Val Ala 
295 300 305 

gta ctg ggc att gac aac att aat gaa cat gaa cgc act gag ata etc 
Val Leu Gly He Asp Asn He Asn Glu His Glu Arg Thr Glu He Leu 
310 315 320 

ggc ggg cgc age aaa ctg ata aag gat ate gcc gga cat ctg ctg ccg 
Gly Gly Arg Ser Lys Leu He Lys Asp He Ala Gly His Leu Leu Pro 
325 330 335 

atg att cgc get gaa caa ccg cag cgt cag tgg gca gac cgt teg cgc 
Met He Arg Ala Glu Gin Pro Gin Arg Gin Trp Ala Asp Arg Ser Arg 
340 345 350 355 

aca gtg ctg gcc ggg cag age etc ggc ggg ate agt gcg eta atg ggg 
Thr Val Leu Ala Gly Gin Ser Leu Gly Gly He Ser Ala Leu Met Gly 
360 365 370 

get cgt tac gca ccg gaa acg ttc ggt ctg gtg etc age cac tct cct 
Ala Arg Tyr Ala Pro Glu Thr Phe Gly Leu Val Leu Ser His Ser Pro 
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caa tgc 
Gin 



<210> 28 
<211> 70 
<212> PRT 

<213> Escherichia coli 
<400> 28 

Gly Lys His Ser Ala Leu lie Val Ala His Arg Leu Thr Thr Ala Gin 
15 10 15 

Arg Cys Asp Leu lie Ala Val lie Asp Lys Gly Leu Leu Ala Glu Tyr 
20 25 30 

Gly Thr His Glu Gin Leu Leu Ser Ala Gly Gly Leu Tyr Thr Arg Leu 
35 40 45 

Trp His Asp Ser Val Ser Ser Thr Ala Leu His Arg Gin His Asn Met 
50 55 60 

Lys Glu Glu Thr Pro Gly 
65 70 



<210> 29 
<211> 317 
<212> PRT 

<213> Escherichia coli 
<400> 29 

Met Leu Asn Met Gin Gin His Leu Ser Ala lie Ala Ser Leu Arg Asn 
15 10 15 

Gin Leu Ala Ala Gly His lie Ala Asn Leu Thr Asp Phe Trp Arg Glu 
20 25 30 

Ala Glu Ser Leu Asn Val Pro Leu Val Thr Pro Val Glu Gly Ala Glu 
35 40 45 

Asp Glu Arg Glu Val Thr Phe Leu Trp Arg Ala Arg His Pro Leu Gin 
50 55 60 

Gly Val Tyr Leu Arg Leu Asn Arg Val Thr Asp Lys Glu His Val Glu 
65 70 75 80 
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Lys Gly Met Met Ser Ala Leu Pro Glu Thr Asp lie Trp Thr Leu Thr 
85 90 95 

Leu Arg Leu Pro Ala Ser Tyr Cys Gly Ser Tyr Ser Leu Leu Glu lie 
100 105 no 

Pro Pro Gly Thr Thr Ala Glu Thr lie Ala Leu Ser Gly Gly Arg Phe 
115 120 125 

Ala Thr Leu Ala Gly Lys Ala Asp Pro Leu Asn Lys Met Pro Glu He 
130 135 140 

Asn Val Arg Gly Asn Ala Lys Glu Ser Val Leu Thr Leu Asp Lys Ala 
145 150 155 160 

Pro Ala Leu Ser Glu Trp Asn Gly Gly Phe His Thr Gly Gin Leu Leu 
165 170 175 

Thr Ser Met Arg He He Ala Gly Lys Ser Arg Gin Val Arg Leu Tyr 
180 185 190 

He Pro Asp Val Asp He Ser Gin Pro Leu Gly Leu Val Val Leu Pro 
195 200 205 

Asp Gly Glu Thr Trp Phe Asp His Leu Gly Val Cys Ala Ala He Asp 
210 215 220 

Ala Ala He Asn Asn Gly Arg He Val Pro Val Ala Val Leu Gly He 
225 230 235 240 

Asp Asn He Asn Glu His Glu Arg Thr Glu He Leu Gly Gly Arg Ser 
245 250 255 

Lys Leu He Lys Asp He Ala Gly His Leu Leu Pro Met He Arg Ala 
260 265 270 

Glu Gin Pro Gin Arg Gin Trp Ala Asp Arg Ser Arg Thr Val Leu Ala 
275 280 285 

Gly Gin Ser Leu Gly Gly He Ser Ala Leu Met Gly Ala Arg Tyr Ala 
290 295 300 

Pro Glu Thr Phe Gly Leu Val Leu Ser His Ser Pro Gin 
305 310 315 
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<210> 30 
<211> 4039 
<212> DNA 

<213> Escherichia coli 



<220> 

<221> CDS 

<222> (1) . . (285) 

<220> 
<221> CDS 

<222> (370) . . (1326) 



<400> 30 

cct tea atg tgg tgg acg cca gaa aga acc agt cga cca ggc ttg ttc 48 

Pro Ser Met Trp Trp Thr Pro Glu Arg Thr Ser Arg Pro Gly Leu Phe 
15 10 15 



age gaa acc gat acc tea tgg gtg agt gag cat ctg ctt tct gec cca 
Ser Glu Thr Asp Thr Ser Trp Val Ser Glu His Leu Leu Ser Ala Pro 
20 25 30 

ccg cag ggc gta cgt ate age ctg tgc gtg gga teg ctg gaa ggt teg 
Pro Gin Gly Val Arg lie Ser Leu Cys Val Gly Ser Leu Glu Gly Ser 



aca gtg cct cac gtt cag cag ctt cac cag egg ctg att acc get ggc 192 
Thr Val Pro His Val Gin Gin Leu His Gin Arg Leu lie Thr Ala Gly 
50 55 60 

gtc gaa age cat tgc gca ate tac acc ggt ggt cac gat tac gca tgg 240 
Val Glu Ser His Cys Ala lie Tyr Thr Gly Gly His Asp Tyr Ala Trp 
65 70 75 80 

tgg cgc ggt gca ctg att gac ggg att ggt tta eta cag ggt tga 285 
Trp Arg Gly Ala Leu lie Asp Gly lie Gly Leu Leu Gin Gly 

85 90 95 

gttgacccac aaacactttc aggaaaeggt acagacttcc tgaataaatc aaatagtcac 345 

ctgcggaaaa ggaataatca tcag atg tat gec cgc gag tat cgc tea aca 396 
Met Tyr Ala Arg Glu Tyr Arg Ser Thr 
100 

cgc ccg cat aaa gcg att ttc ttt cat ctt tct tgc etc acc ctt ate 444 
Arg Pro His Lys Ala He Phe Phe His Leu Ser Cys Leu Thr Leu He 
105 110 115 120 
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tgt agt gcg caa gtt tat gcg aag ccg gat atg egg cca ctg ggg ccg 
Cys Ser Ala Gin Val Tyr Ala Lys Pro Asp Met Arg Pro Leu Gly Pro 
125 130 135 

aat ata gec gat aaa ggc tec gtg ttt tac cat ttc age gtc acc tct 
Asn He Ala Asp Lys Gly Ser Val Phe Tyr His Phe Ser Val Thr Ser 
140 145 150 

ttc gac tct gtc gat ggc aca cgc cat tat egg gta tgg acg gec gtg 
Phe Asp Ser Val Asp Gly Thr Arg His Tyr Arg Val Trp Thr Ala Val 
155 160 165 

ccg aat aca acc gca ccg gca teg ggt tac ccg att tta tat atg ctt 
Pro Asn Thr Thr Ala Pro Ala Ser Gly Tyr Pro He Leu Tyr Met Leu 
170 175 180 

gac ggt aac gca gtt atg gat cgc ctg gat gac gaa ctg etc aaa caa 
Asp Gly Asn Ala Val Met Asp Arg Leu Asp Asp Glu Leu Leu Lys Gin 
185 190 195 200 

ttg tea gaa aaa aca ccg cca gtg ate gtg get gtc ggg tat cag acc 
Leu Ser Glu Lys Thr Pro Pro Val He Val Ala Val Gly Tyr Gin Thr 
205 210 215 

aac etc cct ttc gat etc aac age agg get tac gac tat acg cca gca 
Asn Leu Pro Phe Asp Leu Asn Ser Arg Ala Tyr Asp Tyr Thr Pro Ala 
220 225 230 

gca gaa age aga aaa aca gat etc cac tea ggg cgt ttt age cgt aag 
Ala Glu Ser Arg Lys Thr Asp Leu His Ser Gly Arg Phe Ser Arg Lys 
235 240 245 

agt ggt ggc age aac aac ttc cgc cag tta ctg gaa acg cgt att gee 
Ser Gly Gly Ser Asn Asn Phe Arg Gin Leu Leu Glu Thr Arg He Ala 
250 255 260 

cca aaa gtg gaa cag gga ctg aat ate gat egg caa cgc cgc ggc tta 
Pro Lys Val Glu Gin Gly Leu Asn He Asp Arg Gin Arg Arg Gly Leu 
265 270 275 280 

tgg ggg cac tec tac gg° gg c ctc ttc gtg °tg gat tec tgg ctg tec 

Trp Gly His Ser Tyr Gly Gly Leu Phe Val Leu Asp Ser Trp Leu Ser 
285 290 295 

tcc tct tac ttc egg teg tac tac age gec age ccg teg ttg ggc aga 
Ser Ser Tyr Phe Arg Ser Tyr Tyr Ser Ala Ser Pro Ser Leu Gly Arg 
300 305 310 
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ggt tat gat get ttg eta age cgc gtt acg gcg gtt gag cct ctg caa 
Gly Tyr Asp Ala Leu Leu Ser Arg Val Thr Ala Val Glu Pro Leu Gin 
315 320 325 

ttc tgc gec aaa cac ctg gcg ata atg gaa ggc teg gcg aca cag ggt 
Phe cys Ala Lys His Leu Ala lie Met Glu Gly Ser Ala Thr Gin Gly 
330 335 340 

gat aac egg gaa acg cat get gtc ggg gtg ctg teg aaa att cat ace 
Asp Asn Arg Glu Thr His Ala Val Gly Val Leu Ser Lys He His Thr 
345 350 355 360 

acc etc act ata ctg aaa gat aaa ggc gtc aat gec gta ttt tgg gat 
Thr Leu Thr He Leu Lys Asp Lys Gly Val Asn Ala Val Phe Trp Asp 
365 370 375 

ttc ccc aac ctg gga cac ggg ccg atg ttc aat gec tec ttt cgc cag 
Phe Pro Asn Leu Gly His Gly Pro Met Phe Asn Ala Ser Phe Arg Gin 
380 385 390 

gca ctg tta gat ate agt ggt gaa aac gca aat tac aca gca ggt tgt 
Ala Leu Leu Asp He Ser Gly Glu Asn Ala Asn Tyr Thr Ala Gly Cys 
395 400 405 

cat gag tta age cac taa acactgcccg ettttacgeg ggcagtaege 
His Glu Leu Ser His 
410 

ctgaaacact acgatcagaa tgatgcggta actccggcat agtaagcccg gcctggctcg 1416 
ttataggtat tcgccccttc agaagategg aagatctgtt tattgaggat attactgacg 1476 
ccgacattaa gaegcagatt tttattaata tegtaattga agttcgtccc caccagtgaa 1536 
taagcgccca gctctttacc tgacagaccg ccagtatctt cactgegggt ttccgcatga 1596 
gtacgcggtt tttgtctgcc atataaegtc cagttgaege tggcagaaaa cgcctgggtg 1656 
atggtccagt taagcgagtt attgatagta tattteggga tgaccgacag aggattaccg 1716 
gtgtcttttt gctccgaagt gatcatccat gtggcattgg tattccagtt cagacgatct 177 6 
ttcaccagtg ggaaagacat actggcttcg ataccgtcca ccagagcttt cccgccattc 1836 
tgccacttga ggatatatgc gectgaageg gtttgecega taacgttatc cccggccacg 1896 
atcttattct ggtaatcatt geggaagtag gtcacacttg cgtggtaatc ttcccaggtg 1956 
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aactccagcc caatttcttt attgacgctg 
aggtagcacc cgcctgatgt aatatctttt 
tagccttcac tggattgata caggtttggg 
actttgaaat aatcgcccaa ttcctgcgaa 
gagtcgctga gataatcaaa gcgcaggccg 
atgttatctt caatatacag cgcactgatt 
gcagagccag aaataccgct gatatcactg 
tcatcgagct tatcgcggtt ccactctgca 
ttcaaaggaa tattaagctc gccgctggtt 
ttttcacccg ctaaaatacg cccttcacca 
ttattggttt tctcgtaata aacaccaaag 
tgcgtaatgc cataattctg tcggtacagg 
gcttttcggt aactgcactg gaagaactgt 
gcggctatat ccggcttcga aatcgagaat 
gccgttaata tctttgttac gtaccccttc 
agaattaata tcccaactgt cagcatccgt 
aagagcatta ccagccagag gcccactaag 
atcgctactt tccggctgat tggtgtataa 
acgtttggta atgatgttca ccaccccccc 
agggccgcgg atcacttcaa tacgctcaac 
gcgggtatca cgctctccac gccagctata 
accatcaatt aaaattaagg tgttttccgg 
gttaccgcgt gtgcccgagg cgctattgcc 
aatatctgaa aggtcgttta ccggaggggt 
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atttccggat cgagatcttt attaccgatc 2016 
ggacagccat tgcctttcga gtagagcaga 2076 
gctttaaagg ttcgggcaac ccctgctttg 2136 
agattcagac tggggctgaa gttcccgccg 2196 
ggaatgatat tcgtgccagg aaccggctca 2256 
tgagaatgat ttttactgct gcgatccgca 2316 
tcattcaccg tcaggctggt agaggaagga 2376 
ccaacggtca gcgtttgatc aaccatcaca 2436 
cgccaggaac tcaggcgatt ggtcgtaaac 2496 
ccgccggata atccttcatt catgcgggta 2556 
cgactttgtc cccagtccca gataccatta 2616 
cggctcgtct cttgtgccgg attttgccag 2676 
tttgcgtatc gccgcataga tattcccctg 2736 
ctgctgcgga tttaatttcc acgagacaac 2796 
atgcccggct gcgtttttcg taccgaccgg 2856 
tttattcaga ttaccataca aacgcgtggt 2916 
gctgaaattg gcgcgacgcg tagcgccctc 2976 
cgacagcgaa ccgtgccagt cgttggtggg 303 6 
ggctgccccc gaaccgtagc gcgccgccgc 3096 
ctgttccggt ggcacccagt tggtgtcacc 3156 
acgcacggag ttacgtgacg tcaccggtac 3216 
ccccatacca cgaatatcga tctggcggtt 3276 
ggtaagattg acgccaggca ttttacgaat 3336 
ctttttaata tcctcgctgg taataaccga 3396 
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cacgcccggc tgctgtttta atacctgctc agcggtggct tccaccacca gagtctcgtc 3456 

attatcatcg tcggaggatt tggctactga tacctggcta ttcaacccaa ccaggagcac 3516 

agttagcgac cagaggattt tgttaattct catacctatt ccctaataaa tgcctaactt 3576 

aaaatgtttg atcgttaagc tcacatcctt gccagatatt ttttactgcc attattgttt 3636 

ttatataaga atgataatta atatcattta gcaaaagaaa aagcaatccc tcacaagata 3696 

aatatatcga tttttcataa atatcaaatt gatatataac atatgttttt tatttcattg 3756 

tacttcagtc aaataaattt ctgaagcact gctagtagtg ccagttcagc tttctttttg 3816 

actcattccg gcaaagtcag taccgttcat cttttgtact gatgttgcca ctggaaaatc 3876 

ggtgcgcttg tcgatcatcg ggaattttgt cacaatttct aacggatagt gttcacattg 3936 

tttctaacct gcattttcag acacgggcgc tgcttatgta tataagatca gcatcactag 3996 

gtctttctgc aacactactg ctttcaacaa ggtcaggcat ttc 4039 



<210> 31 
<211> 94 
<212> PRT 

<213> Escherichia coli 
<400> 31 

Pro Ser Met Trp Trp Thr Pro Glu Arg Thr Ser Arg Pro Gly Leu Phe 
15 10 15 

Ser Glu Thr Asp Thr Ser Trp Val Ser Glu His Leu Leu Ser Ala Pro 



Pro Gin Gly Val Arg He Ser Leu Cys Val Gly Ser Leu Glu Gly Ser 
35 40 45 

Thr Val Pro His Val Gin Gin Leu His Gin Arg Leu He Thr Ala Gly 
50 55 60 

Val Glu Ser His Cys Ala He Tyr Thr Gly Gly His Asp Tyr Ala Trp 
65 70 75 80 

Trp Arg Gly Ala Leu He Asp Gly He Gly Leu Leu Gin Gly 
85 90 
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<210> 32 
<211> 318 
<212> PRT 

<213> Escherichia coli 
<400> 32 

Met Tyr Ala Arg Glu Tyr Arg Ser Thr Arg Pro His Lys Ala He Phe 



Phe His Leu Ser Cys Leu Thr Leu He Cys Ser Ala Gin Val Tyr Ala 
20 25 30 

Lys Pro Asp Met Arg Pro Leu Gly Pro Asn He Ala Asp Lys Gly Ser 



Val Phe Tyr His Phe Ser Val Thr Ser Phe Asp Ser Val Asp Gly Thr 
50 55 60 

Arg His Tyr Arg Val Trp Thr Ala Val Pro Asn Thr Thr Ala Pro Ala 
65 70 75 80 

Ser Gly Tyr Pro He Leu Tyr Met Leu Asp Gly Asn Ala Val Met Asp 



Arg Leu Asp Asp Glu Leu Leu Lys Gin Leu Ser Glu Lys Thr Pro Pro 
100 105 HO 

Val He Val Ala Val Gly Tyr Gin Thr Asn Leu Pro Phe Asp Leu Asn 
115 120 125 

Ser Arg Ala Tyr Asp Tyr Thr Pro Ala Ala Glu Ser Arg Lys Thr Asp 
130 135 140 

Leu His Ser Gly Arg Phe Ser Arg Lys Ser Gly Gly Ser Asn Asn Phe 
145 150 155 160 

Arg Gin Leu Leu Glu Thr Arg He Ala Pro Lys Val Glu Gin Gly Leu 
165 170 175 

Asn He Asp Arg Gin Arg Arg Gly Leu Trp Gly His Ser Tyr Gly Gly 
180 185 190 

Leu Phe Val Leu Asp Ser Trp Leu Ser Ser Ser Tyr Phe Arg Ser Tyr 
195 200 205 

Tyr ser Ala Ser Pro Ser Leu Gly Arg Gly Tyr Asp Ala Leu Leu Ser 
210 215 220 
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Arg Val Thr Ala Val Glu Pro Leu 
225 230 

lie Met Glu Gly Ser Ala Thr Gin 
245 

Val Gly Val Leu Ser Lys He His 
260 

Lys Gly Val Asn Ala Val Phe Trp 
275 280 

Pro Met Phe Asn Ala Ser Phe Arg 
290 295 

Glu Asn Ala Asn Tyr Thr Ala Gly 
305 310 



Gin Phe Cys Ala Lys His Leu Ala 
235 240 

Gly Asp Asn Arg Glu Thr His Ala 
250 255 

Thr Thr Leu Thr He Leu Lys Asp 
265 270 

Asp Phe Pro Asn Leu Gly His Gly 
285 

Gin Ala Leu Leu Asp lie Ser Gly 
300 

Cys His Glu Leu Ser His 
315 



<210> 33 
<211> 3292 
<212> DMA 

<213> Escherichia coli 



<400> 33 

ccgctgcggt tgattgccgg atgcggcgtg aacgccttat ccggcctaca atcattgcaa 60 
attcaataaa ttgcagcgtt ctgtaggctg gataagatgc gtcagcatcg catccggcaa 120 
aggcagatct cagcgatagc gccggcttag tcagatttaa tctgcgcgcg tggtggatat 180 
tttttcagga tctccatata cgcgtgcatt tcggtctgta gcggtacacc catcggaata 240 
tggcgcacgc cgatagagtc gctttcctgc ggatcggtgt acaggttaaa caccgacgat 300 
cccgccgttt gcattactgt gccggtgaat ccaccctgat atccgctctg ggtataagcg 360 
taaggttgct gaatcaggac gtgatacttg aacteatcca tacgcacagc agaaagttta 420 
ccgttgagga agtagtgctc ggccttacgg ttagactgac catttgttcc caggaagaaa 480 
gatgtctggt ccacaccatc gataaaggtg gttttcggta ctaaattcgc tactttcgcc 540 
ccaggatgcc ccgccagatc cagcgccgta gggaagagat ccgccagatc gacaatgccg 600 
tcagatttac gcggttggat catgcctttc cagtaaacga aggttggcac acgaacgccg 660 
ccttcccatg tagaaccttt cgcaccgcgg aacggcgtgc gaccgtgcgg cggaacttcg 720 
gcttccggac cgttatcaga ggtgaacaca atcagcgtgt tatcaagctg accgtttttc 780 
tccagtgctt tatacagatt ggcgaagata tcgttcatct ccaccatgca gtcgccgtaa 840 
gaggtgcgcg ccggagagct acccgcatat ttggcgttcg ggtagttatc gaagtgacag 900 
ccgcgagtgc cgtaatagag gaagaaaggc ttatcactct tcgccatctt gtcgaggaac 960 
ttaacgccat attccatcca gcgttgatcc aggtcttcca tatatttcgg cgtaatgtcg 1020 
gcaatggctt cctgttcacc gccgcgcacg gcatgaacgt catctttgct gaacggcagt 1080 
ttttggatgt attcagaacg agccgggctc agggcgactt ccgggttgac atgaacatca 1140 
cgccattcgg tatacatatc ggacaccgag ttaaagccgc ggaaatcatc aaagccaacg 1200 
ttctgcggct gcgactcttt gttttccccc atatgccatt tcccgatagc ctgggtgacg 1260 
tagccctgat cgtgcagcaa ctgcggcagc gtggttaacc cttgcagacc gcccggttgc 1320 
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ccgtacattg gcggcatcag aatgccgtgg tggatggagt attgtccggt gagaatcgtg 1380 
gcgcgggttg gggaagaact cggttgagaa tacgccgaag ttaaaatcag cccctggctg 1440 
gcaacggcgt cgatatctgg tgtagggtta cccaccgcca cgccgccgcc gttaaagcca 1500 
acgtccatcc agcccacatc atccagcaag aaaacaacca cgttcggttt cttaccggtt 1560 
tttttctcaa gttccgccag tttctgctgg gtttctttgt cttgtgcagg atgctgcatc 1620 
accggcatca tgttgtcggc aatagtggtc gccggtttaa ccagatactg gtttgggtga 1680 
tcgtatccgg caaagccttt gcgtgcggtg gcggttgacg gggtatctgc tgcgctggct 1740 
atgagaggaa gagctgcggc gacagcaaca acaagaagtt tgggtgaaaa cgaaaattcc 1800 
atgcaaaatg ctccggtttc atgtcgtcaa aatgatgacg taattaagca ttgataattg 1860 
agatccctct ccctgacagg atgattgcat aaataatagt gataaaaata aattatttat 1920 
ttatccagaa atgaattgaa aaatcaggag agcattttca atcctacctc tggcgcaggt 1980 
gatattgtaa ggcggtgatg ttatatcgcg ttgattattg atgctgtttt tagttttaac 2040 
ggcattaata tatatgttat taattgaatg acttttatta ttcattatat atatgtgtag 2100 
aattgtgcgc aggagaaata ttcactcagg aagttattac tcaggaagca aagaggatta 2160 
cagaattatc tcataacaag tgttaaggga tgttatttcc cggttctctg tggcataata 2220 
aacgagtaga tgctcattcc atctcttatg ttcgccttag tgcctcataa actccggaat 2280 
gacgcagagc cgtttacggt gcttatcgtc cactgacaga tgtcgcttat gcctcatcag 2340 
acaccatgga cacaacgttg agtgaagcac ccacttgttg tcatacagac ctgttttaac 2400 
gcctgctccg taataagagc aggcgttttt ttatatatca gaaaggcccc ggaggtgctt 2460 
gcctccgggt gagaaagagc tactgtggcg ggttgttctg caacgttaac atcaaaccgt 2520 
cgcgacgcat cgctgcggct tcttccggct tgtgcagtct gtccagcgcg tcggcaagcc 2580 
atgcgtaatc gtaggcgtcc ggacgttgtt tcagcgctgc gcggaaggcg agcgatgctt 2640 
cctgccattc tccgtgtttc atcagcgact gacccagtgt gctccacaac agcgggcgat 2700 
cgcccacgtt tttgatctgt tggcgcagca ctttttccaa gctgttcccg ggttaattgg 2760 
gttttcagac gccggggatc ggcagcagca ggccgaatcg tcatactggc gtttcaggcc 2820 
atcgatgata atttgctggg cagtatcatg atcgtcacac tcaataagat gttccgccat 2880 
tgccacctgc aaggccacct gatgacgcgt tttccggctt tggtttttcc accagttacg 2940 
caaaccttca ctaccgttat cggcacgcgc ctgatccatc aggccaatcc atgcctgttg 3000 
ttccagcatt gcacgatgtt cttcatcacc aacatgggct ttcgccattg atgggatgat 3060 
atccagcagc gaactccatg caccagtgcg gatatacgcc tgttccgcca gacgtaatac 3120 
ttccggatgg cgtggcgtaa cttccagcag cttatccacg ccgtgacgtg ctgcatggtt 3180 
ttcattacgg gccagttgca gacgtacacg ggtgatttct accggaatgg tgtcattgcc 3240 
ggccagctcc gctgcgcgtt ccagatgttg gttggcgcgt gcttcagcat cc 3292 



<210> 34 
<211> 11165 
<212> DNA 

<213> Escherichia coli 

<220> 
<221> CDS 

<222> (3791) . . (4834) 

<220> 
<221> CDS 

<222> (10459) .. (10776) 
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<220> 
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<222> (10134) . . (10427) 

<220> 
<221> CDS 

<222> (9836) (10081) 

<220> 
<221> CDS 

<222> (7816) . . (9480) 

<220> 
<221> CDS 

<222> (4878) . . (7802) 

<220> 
<221> CDS 

<222> (3460) . . (3702) 

<220> 
<221> CDS 
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<220> 
<221> CDS 

<222> (2613) . . (3041) 

<220> 
<221> CDS 

<222> (2198) . . (2530) 

<220> 
<221> CDS 

<222> (1939) . . (2196) 

<220> 
<221> CDS 

<222> (1573) . . (1893) 

<220> 
<221> CDS 

<222> (1102) . . (1485) 

<220> 

<221> CDS 

<222> (2) . . (1099) 
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c age gat atg cag cgc ggt ate cag get gca acg get gca ctt cag ggc 49 

Ser Asp Met Gin Arg Gly lie Gin Ala Ala Thr Ala Ala Leu Gin Gly 

15 10 15 

ctg gtg ggc ggc aat atg gca ggc gcg ctg gca ggt get tea gcg ccg 97 
Leu Val Gly Gly Asn Met Ala Gly Ala Leu Ala Gly Ala Ser Ala Pro 
20 25 30 

gag ctg gcg aac ate ate ggt cat cac gcg ggt att gat gac aat aca 145 
Glu Leu Ala Asn lie lie Gly His His Ala Gly lie Asp Asp Asn Thr 
35 40 45 



gcg gca aaa gec att gec cat gec att etc ggt ggt gtg aca gca gec 193 
Ala Ala Lys Ala lie Ala His Ala lie Leu Gly Gly Val Thr Ala Ala 
50 55 60 

ctt cag ggc aac agt gcg gca gca ggc gca att ggt gcg ggt act ggt 241 
Leu Gin Gly Asn Ser Ala Ala Ala Gly Ala lie Gly Ala Gly Thr Gly 
65 70 75 80 

gaa gtg ate gcg tea gec att gcg aaa age etc tac ccg ggc gta gat 289 
Glu Val He Ala Ser Ala He Ala Lys Ser Leu Tyr Pro Gly Val Asp 
85 90 95 

ccg teg aaa ctg aca gaa gat cag aag caa act gta age acg ctg gca 337 
Pro Ser Lys Leu Thr Glu Asp Gin Lys Gin Thr Val Ser Thr Leu Ala 
100 105 110 

acg ctg tea gcg ggt atg gec ggc ggc att gec agt ggc gat gtg get 385 
Thr Leu Ser Ala Gly Met Ala Gly Gly He Ala Ser Gly Asp Val Ala 
115 120 125 

ggc gcg get get gga get ggt gee ggg aag aac gtt gtt gag aat aat 433 
Gly Ala Ala Ala Gly Ala Gly Ala Gly Lys Asn Val Val Glu Asn Asn 
130 135 140 

gcg ctg agt ctg gtt gec aga ggc tgt gcg gtc gca gca cct tgc agg 481 
Ala Leu Ser Leu Val Ala Arg Gly Cys Ala Val Ala Ala Pro Cys Arg 
145 150 155 160 

act aaa gtt gca gag cag ttg eta gaa ate ggg gcg aaa gcg ggc atg 529 
Thr Lys Val Ala Glu Gin Leu Leu Glu He Gly Ala Lys Ala Gly Met 
165 170 175 

gec ggg ctt gec ggg gcg gca gtc aag gat atg gec gac agg atg ace 577 
Ala Gly Leu Ala Gly Ala Ala Val Lys Asp Met Ala Asp Arg Met Thr 
180 185 190 
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tec gat gaa ctg gag cat ctg att acc ctg caa atg atg ggt aat gat 
Ser Asp Glu Leu Glu His Leu He Thr Leu Gin Met Met Gly Asn Asp 
195 200 205 

gag ate act act aag tat etc agt teg ttg cat gat aag tac ggt tec 
Glu He Thr Thr Lys Tyr Leu Ser Ser Leu His Asp Lys Tyr Gly Ser 
210 215 220 

ggg get gee teg aat ccg aat ate ggt aaa gat ctg acc gat gcg gaa 
Gly Ala Ala Ser Asn Pro Asn He Gly Lys Asp Leu Thr Asp Ala Glu 
225 230 235 240 

aaa gta gaa ctg ggc ggt tec ggc tea gga acc ggt aca cca cca cca 
Lys Val Glu Leu Gly Gly Ser Gly Ser Gly Thr Gly Thr Pro Pro Pro 
245 250 255 

teg gaa aat gat cct aag cag caa aat gaa aaa act gta gat aag ctt 
Ser Glu Asn Asp Pro Lys Gin Gin Asn Glu Lys Thr Val Asp Lys Leu 
260 265 270 

aat cag aag caa gaa agt gcg att aag aag ate gat aac act ata aaa 
Asn Gin Lys Gin Glu Ser Ala He Lys Lys He Asp Asn Thr He Lys 
275 280 285 

aat get ctg aaa gat cat gat att att gga act etc aag gat atg gat 
Asn Ala Leu Lys Asp His Asp He He Gly Thr Leu Lys Asp Met Asp 
290 295 300 

ggt aag cca gtt cct aaa gag aat gga gga tat tgg gat cat atg cag 
Gly Lys Pro Val Pro Lys Glu Asn Gly Gly Tyr Trp Asp His Met Gin 
305 310 315 320 

gaa atg caa aat acg etc aga gga tta aga aat cat gcg gat acg ttg 
Glu Met Gin Asn Thr Leu Arg Gly Leu Arg Asn His Ala Asp Thr Leu 
325 330 335 

aaa aac gtc aac aat cct gaa get cag get gcg tat ggc aga gca aca 
Lys Asn Val Asn Asn Pro Glu Ala Gin Ala Ala Tyr Gly Arg Ala Thr 
340 345 350 

gat get att aat aaa ata gaa tea gec ttg aaa gga tat gga at atg 
Asp Ala He Asn Lys He Glu Ser Ala Leu Lys Gly Tyr Gly Met 
355 360 365 

att acc tta cgt aaa ttg att gga aac ate aat atg aca aaa gag cct 
He Thr Leu Arg Lys Leu He Gly Asn He Asn Met Thr Lys Glu Pro 
370 375 380 
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gag caa caa tea ccg ctt gaa etc tgg ttc gaa cgt ate ata gat gtg 1200 
Glu Gin Gin Ser Pro Leu Glu Leu Trp Phe Glu Arg He He Asp Val 
385 390 395 

cct ctt gaa aag tta aca gtg gaa gat ctt tgc cgc get ate cga caa 1248 
Pro Leu Glu Lys Leu Thr Val Glu Asp Leu Cys Arg Ala He Arg Gin 
400 405 410 415 

aat tta tgt att gat cag ttg atg cca aga gtg ttg gaa gtt eta act 1296 
Asn Leu Cys He Asp Gin Leu Met Pro Arg Val Leu Glu Val Leu Thr 
420 425 430 

aaa gag ccg tta gcg ggt gaa tat tac gat ggt gaa eta att gca get 1344 
Lys Glu Pro Leu Ala Gly Glu Tyr Tyr Asp Gly Glu Leu He Ala Ala 
435 440 445 

tta tea acg ata aaa gga gaa gat eta aaa gat cag aaa agt ace ttt 1392 
Leu Ser Thr He Lys Gly Glu Asp Leu Lys Asp Gin Lys Ser Thr Phe 
450 455 460 

acc caa ata agg caa ctt ata aac cag eta gaa ccg tea gat att aac 1440 
Thr Gin He Arg Gin Leu He Asn Gin Leu Glu Pro Ser Asp He Asn 
465 470 475 

gat gat tta aga aaa gat ata tta aaa ate aat cag ata att gta 1485 

Asp Asp Leu Arg Lys Asp He Leu Lys He Asn Gin He He Val 
480 485 490 

taactaatcc cggccactga gecgagatet tctttgtgtg ccgggcatgt tcagcagctt 1545 

gggggtgaaa gtcccctgtc cagectg atg gtg gcg aag gcg ttc gcg tac gca 1599 
Met Val Ala Lys Ala Phe Ala Tyr Ala 
495 500 

ctt aac cag tgg ccg gca ctg acg tac tat gcg aac gat ggc tgg gtg 1647 
Leu Asn Gin Trp Pro Ala Leu Thr Tyr Tyr Ala Asn Asp Gly Trp Val 
505 510 515 

gaa ate gac aac aac ate get gaa aat gee ctg egg gcg gtc agt ctg 1695 
Glu He Asp Asn Asn He Ala Glu Asn Ala Leu Arg Ala Val Ser Leu 
520 525 530 535 

ggt cgt aaa aac ttc ctg ttc ttc ggc tct gac cat ggt ggt gag egg 1743 
Gly Arg Lys Asn Phe Leu Phe Phe Gly Ser Asp His Gly Gly Glu Arg 
540 545 550 

gga gcg eta ctg tac age ctg ate ggg acg tgc aaa ctg aat gac gtg 1791 
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Gly Ala Leu Leu Tyr Ser Leu lie Gly Thr Cys Lys Leu Asn Asp Val 
555 560 565 

gat cca gaa age tac ctt cgc cat gtg ctt gec gtc ata gca gac tgg 1839 
Asp Pro Glu Ser Tyr Leu Arg His Val Leu Ala Val lie Ala Asp Trp 
570 575 580 

ccg gtc aac egg gtc age gaa ctg ctt ccg tgg cgc ata gca ctg cca 1887 
Pro Val Asn Arg Val Ser Glu Leu Leu Pro Trp Arg lie Ala Leu Pro 
585 590 595 

get gaa taacacatcc ccgtcaatac ggccctcgct gtaegcttae agaaa atg ctg 1944 
Ala Glu Met Leu 

600 

atg tct gta cag aaa gaa aag aac gtc gca gag agt gtg gta tct gaa 1992 
Met Ser Val Gin Lys Glu Lys Asn Val Ala Glu Ser Val Val Ser Glu 
605 610 615 

acg cat acc ggc gac age gta tat get tec ctg ttt gaa aaa att aac 2040 
Thr His Thr Gly Asp Ser Val Tyr Ala Ser Leu Phe Glu Lys lie Asn 
620 625 630 635 

ctg aat ccg gta tct gee ctg agt gca ctg gat aac cct ttc egg tea 2088 
Leu Asn Pro Val Ser Ala Leu Ser Ala Leu Asp Asn Pro Phe Arg Ser 
640 645 650 

gca gat aac gcg act ggc aga att acc tec age ata caa cct gcg gtg 2136 
Ala Asp Asn Ala Thr Gly Arg lie Thr Ser Ser lie Gin Pro Ala Val 
655 660 665 

cag tgc gca get get gca gca act gag ggt tct tgt ccc egg caa tec 2184 
Gin Cys Ala Ala Ala Ala Ala Thr Glu Gly Ser Cys Pro Arg Gin Ser 
670 675 680 

ccg tgt tea gga a atg gtg gat aac tgg cag aag agt gta agg agt cgt 2233 
Pro Cys Ser Gly Met Val Asp Asn Trp Gin Lys Ser Val Arg Ser Arg 
685 690 695 

gcg etc ccg gaa gag gcg atg acg ggc tgg aac gaa ggc atg ate cgc 2281 
Ala Leu Pro Glu Glu Ala Met Thr Gly Trp Asn Glu Gly Met lie Arg 
700 705 710 715 

tta cag cag ttg get gag cgc ctg aac cgt cag gat gaa cag egg gga 2329 
Leu Gin Gin Leu Ala Glu Arg Leu Asn Arg Gin Asp Glu Gin Arg Gly 
720 725 730 

aaa tac atg acg gtc agt gaa ctg aaa acg gag gtg ttt ggc ate atg 2377 
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Lys Tyr Met Thr Val Ser Glu Leu Lys Thr Glu Val Phe Gly lie Met 
735 740 745 

cag get ttt aac egg cat ate ccg gcg gaa gag cag tta cgt cgc tac 2425 
Gin Ala Phe Asn Arg His lie Pro Ala Glu Glu Gin Leu Arg Arg Tyr 
750 755 760 

ggt gaa gtc cgt aac cag aat ggc agt gaa cag cag caa aaa cag get 2473 
Gly Glu Val Arg Asn Gin Asn Gly Ser Glu Gin Gin Gin Lys Gin Ala 
765 770 775 

gaa atg gcg eta aat cag tta att aac cgt tat cag atg ata cgt gca 2521 
Glu Met Ala Leu Asn Gin Leu He Asn Arg Tyr Gin Met He Arg Ala 
780 785 790 795 

ggc aaa caa tagtggtagc cataatgeag gagcaaagee tgaatcagga 2570 
Gly Lys Gin 

agagttattc tgactgagtt tggttttctg gcgattcttg tg atg gtg gga tgt 2624 

Met Val Gly Cys 
800 

get tgg tta get gaa cag gee ttt tec gac cat gcg ctt tea cca cac 2672 
Ala Trp Leu Ala Glu Gin Ala Phe Ser Asp His Ala Leu Ser Pro His 
805 810 815 

agt get tgg ccg tac agt gca teg cgc gat gee ggg ctg gee gat acg 2720 
Ser Ala Trp Pro Tyr Ser Ala Ser Arg Asp Ala Gly Leu Ala Asp Thr 
820 825 830 

ggc gcg ggc ggc tat ccc act tgt aaa cag egg tgg gee gac gac acc 2768 
Gly Ala Gly Gly Tyr Pro Thr Cys Lys Gin Arg Trp Ala Asp Asp Thr 
835 840 845 850 

gtt ggg ctg aaa gec cgt eta ctg caa ctt cct gec eta gat ate tgg 2816 
Val Gly Leu Lys Ala Arg Leu Leu Gin Leu Pro Ala Leu Asp He Trp 
855 860 865 

acg gcg ttt aaa aaa ate gac cag teg cag gta gtg tat gaa gag gee 2864 
Thr Ala Phe Lys Lys He Asp Gin Ser Gin Val Val Tyr Glu Glu Ala 
870 875 880 

gtg ctg cgc teg egg gtc agt gaa cga aat atg cag gta teg cag aat 2912 
Val Leu Arg Ser Arg Val Ser Glu Arg Asn Met Gin Val Ser Gin Asn 
885 890 895 

ggg cgc gtt tat cca age tat ggc ggt aac gtt gat ggc acc gtc gee 2960 
Gly Arg Val Tyr Pro Ser Tyr Gly Gly Asn Val Asp Gly Thr Val Ala 
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900 905 910 

aat gcc gcc acc egg ttg gca tec ggc get aga aat ate etc ggc age 3008 
Asn Ala Ala Thr Arg Leu Ala Ser Gly Ala Arg Asn lie Leu Gly Ser 
915 920 925 930 

ata gcg gca tgt acg gca ttc gac age gtg cgt taggcactac eg atg gta 3059 
lie Ala Ala Cys Thr Ala Phe Asp Ser Val Arg Met Val 

935 940 

cag gcg cag ctg caa ata gcg ctg gtg ate tgt att ccg ctg ata acg 3107 
Gin Ala Gin Leu Gin lie Ala Leu Val He Cys He Pro Leu He Thr 
945 950 955 

etc tgt teg gcg tgg gat gtg aaa gta gtg atg acg ctg acg ttt gtg 3155 
Leu Cys Ser Ala Trp Asp Val Lys Val Val Met Thr Leu Thr Phe Val 
960 965 970 975 

cag ttt gca eta ttt ttc etc acc ttt tgg tgg gaa ctg gca egg tgg 3203 
Gin Phe Ala Leu Phe Phe Leu Thr Phe Trp Trp Glu Leu Ala Arg Trp 
980 985 990 

ctt gat age tgg ctg ctg gat gtg etc tac aac age gat acc cac agt 3251 
Leu Asp Ser Trp Leu Leu Asp Val Leu Tyr Asn Ser Asp Thr His Ser 
995 1000 1005 

age tgg aat tta gcc ggg ate cag aat acg cag gat gac gtg att ate 3299 
Ser Trp Asn Leu Ala Gly He Gin Asn Thr Gin Asp Asp Val He He 
1010 1015 1020 

aat ctg gtg atg agg ttg atg ttt ctg gtg ttg ccg aca ttc tgg ctg 3347 
Asn Leu Val Met Arg Leu Met Phe Leu Val Leu Pro Thr Phe Trp Leu 
1025 1030 1035 

ggg gcg atg acg tgg get gga gtg agg gtt ggc gtg gcg ctg aat gga 3395 
Gly Ala Met Thr Trp Ala Gly Val Arg Val Gly Val Ala Leu Asn Gly 
1040 1045 1050 1055 

gcg ctg gcg gga tgattgggag gtgattcgee aatctcactt tcctatacac 3447 
Ala Leu Ala Gly 

atataaaatg ta atg aaa tat etc ttt ttt gag aat ata cat tct ata ttt 3498 

Met Lys Tyr Leu Phe Phe Glu Asn He His Ser He Phe 
1060 1065 1070 

tta aca ttc agt etc ttc cga aca tct gtg teg cct gat ttc cca atg 3546 
Leu Thr Phe Ser Leu Phe Arg Thr Ser Val Ser Pro Asp Phe Pro Met 
1075 1080 1085 
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att ttt gca ttg ccc tea ate att tta ggt caa ttt acg acc aac caa 3594 
lie Phe Ala Leu Pro Ser lie lie Leu Gly Gin Phe Thr Thr Asn Gin 
1090 1095 1100 



tta act aac ttt gtg ata tgt atg ggt aac acc gtt gaa cgt egg ctg 
Leu Thr Asn Phe Val He Cys Met Gly Asn Thr Val Glu Arg Arg Leu 
1105 1110 1115 1120 

ggt gtt gtt cat aat ccc ttt aaa agg tct ggg gat ggc cat gac etc 
Gly Val Val His Asn Pro Phe Lys Arg Ser Gly Asp Gly His Asp Leu 
1125 1130 1135 



agg gcg gta gcg tgaccaaagt tcatatccat accaattatt tttatttaaa 3742 
Arg Ala Val Ala 
1140 



atatcaactt attcgagttg ttttatttag ttcaaagaag gtatcaaa ttg ata gtt 3799 

Leu He Val 



ata gat ttt ttt tgt ggc tgt ggt gga gec agt gaa ggg eta cgt cag 3847 
He Asp Phe Phe Cys Gly Cys Gly Gly Ala Ser Glu Gly Leu Arg Gin 
1145 1150 1155 



get ggc ttt gat ate gag ctt gga tta gat att gac caa caa gca tea 3895 
Ala Gly Phe Asp He Glu Leu Gly Leu Asp He Asp Gin Gin Ala Ser 
1160 1165 1170 1175 



gaa aca ttt aaa get aat ttc ect gat gca aaa ttc ate caa gat gat 3943 
Glu Thr Phe Lys Ala Asn Phe Pro Asp Ala Lys Phe He Gin Asp Asp 
1180 1185 1190 



att agg aaa ate gaa cct caa gat ate tec gac ate att gat att aaa 3991 
He Arg Lys He Glu Pro Gin Asp He Ser Asp He He Asp He Lys 
1195 1200 1205 



get aaa egg cct ttg tta ctg agt gca tgt gca cca tgt caa cca ttt 4039 
Ala Lys Arg Pro Leu Leu Leu Ser Ala Cys Ala Pro Cys Gin Pro Phe 
1210 1215 1220 



teg caa cag aat aaa aat aaa act agt gac gac tea agg aga aat eta 4087 
Ser Gin Gin Asn Lys Asn Lys Thr Ser Asp Asp Ser Arg Arg Asn Leu 
1225 1230 1235 

eta aat gaa act cat cgt ttt att aga gaa ctt ctt cct gaa tat att 4135 
Leu Asn Glu Thr His Arg Phe He Arg Glu Leu Leu Pro Glu Tyr He 
1240 1245 1250 1255 
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atg ctt gaa aat gtt cct gga atg caa aaa att gat gaa gaa aaa gaa 
Met Leu Glu Asn Val Pro Gly Met Gin Lys He Asp Glu Glu Lys Glu 
1260 1265 1270 

ggc cca ttt cag gag ttt att aag eta ctt aaa gag tta gag tat aac 
Gly Pro Phe Gin Glu Phe He Lys Leu Leu Lys Glu Leu Glu Tyr Asn 
1275 1280 1285 

tat ata tct ttt ata gec aat get gag aac tat ggg att ccc caa aga 
Tyr He Ser Phe He Ala Asn Ala Glu Asn Tyr Gly He Pro Gin Arg 
1290 1295 1300 

aga aaa aga etc gtg etc tta get agt cga gta ggt aaa gtt ace eta 
Arg Lys Arg Leu Val Leu Leu Ala Ser Arg Val Gly Lys Val Thr Leu 
1305 1310 1315 

cca gag ata acc cat ggt aaa aat aaa ate cca ttc aaa act gta cga 
Pro Glu He Thr His Gly Lys Asn Lys He Pro Phe Lys Thr Val Arg 
1320 1325 1330 1335 

gat tat ate cag gac ttc aca aag tta tgt tea gga gaa acc gac ccc 
Asp Tyr He Gin Asp Phe Thr Lys Leu Cys Ser Gly Glu Thr Asp Pro 
1340 1345 1350 

aaa gat cct tta cat agg get gga aca ctg age cct ctt aac eta aaa 
Lys Asp Pro Leu His Arg Ala Gly Thr Leu Ser Pro Leu Asn Leu Lys 
1355 1360 1365 

aga att atg cac act cca gaa gga ggg gat aga aga aat tgg cca gaa 
Arg He Met His Thr Pro Glu Gly Gly Asp Arg Arg Asn Trp Pro Glu 
1370 1375 1380 

gag tta gtt aat aaa tgc cat aaa aat tat gat ggc cac aca gat act 
Glu Leu Val Asn Lys Cys His Lys Asn Tyr Asp Gly His Thr Asp Thr 
1385 1390 1395 

tat gga aga atg agt tgg gat aag cct gcg cct aca ctt acg acg aaa 
Tyr Gly Arg Met Ser Trp Asp Lys Pro Ala Pro Thr Leu Thr Thr Lys 
1400 1405 1410 1415 

tgt aat agt tac tec aat ggt cgt ttt ggg cat cct gac ccc act caa 
cys Asn Ser Tyr Ser Asn Gly Arg Phe Gly His Pro Asp Pro Thr Gin 
1420 1425 1430 

cat aga gca att age ata aga gaa gca tea aga tta caa aca ttt cct 
His Arg Ala He Ser He Arg Glu Ala Ser Arg Leu Gin Thr Phe Pro 
1435 1440 1445 
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tta age tat gtt ttt aaa ggt teg ctg aat tea atg gca aag caa ate 4759 
Leu Ser Tyr Val Phe Lys Gly Ser Leu Asn Ser Met Ala Lys Gin lie 
1450 1455 1460 

ggc aat get gta cct tgc gaa etc get aga eta ttt ggg eta cat etc 4807 
Gly Asn Ala Val Pro Cys Glu Leu Ala Arg Leu Phe Gly Leu His Leu 
1465 1470 1475 

ata gaa aat tgt act aat aag gat tea tagatatatg gctaaaataa 4854 
lie Glu Asn Cys Thr Asn Lys Asp Ser 
1480 1485 

gaacaaaggc tcgagctttg gac atg ctt ggc aga caa caa att gca ggt ata 4907 
Met Leu Gly Arg Gin Gin lie Ala Gly lie 
1490 1495 

cct act gee ttg agt gag tta ttt aaa aat get cat gat gec tat get 4955 
Pro Thr Ala Leu Ser Glu Leu Phe Lys Asn Ala His Asp Ala Tyr Ala 
1500 1505 1510 

gat aat gtc gaa gtt gat ttt ttt agg aaa gaa aat ctt ctt ate ttg 5003 
Asp Asn Val Glu Val Asp Phe Phe Arg Lys Glu Asn Leu Leu lie Leu 
1515 1520 1525 1530 

aga gat gat gga tta ggt atg aca acc gat gaa ttt gaa gag agg tgg 5051 
Arg Asp Asp Gly Leu Gly Met Thr Thr Asp Glu Phe Glu Glu Arg Trp 
1535 1540 1545 

ttg act att gga acc tec age aaa tta ate gac gat gat gca att aat 5099 
Leu Thr lie Gly Thr Ser Ser Lys Leu lie Asp Asp Asp Ala lie Asn 



1550 1555 

aaa cca gca gtg gat agt aat aaa gec 
Lys Pro Ala Val Asp Ser Asn Lys Ala 
1565 1570 

aaa gga ata ggc cgt tta tct 
Lys Gly lie Gly Arg Leu Ser 
1580 1585 

gtt ctt act agg gec aaa aga gac aat 
Val Leu Thr Arg Ala Lys Arg Asp Asn 
1595 1600 



1560 

ttt cgc cct ate atg gga gag 5147 
Phe Arg Pro lie Met Gly Glu 
1575 



gag ctt aag cca tta gtt get 5243 
Glu Leu Lys Pro Leu Val Ala 
1605 1610 



ate gca gca att gga cca cag gtg ctg 5195 
lie Ala Ala lie Gly Pro Gin Val Leu 
1590 



gca ttt gtt aat tgg agt tta ttt get ata cca tea ctt gat ctt gat 5291 
Ala Phe Val Asn Trp Ser Leu Phe Ala lie Pro Ser Leu Asp Leu Asp 
1615 1620 1625 
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gat ata gaa ata cca att aga act att ate aac gac gaa tgc ttc act 
Asp He Glu He Pro He Arg Thr He He Asn Asp Glu Cys Phe Thr 
1630 1635 1640 

aaa aaa act ctt gat gag atg att gag caa gca aga aat aat tta gac 
Lys Lys Thr Leu Asp Glu Met He Glu Gin Ala Arg Asn Asn Leu Asp 
1645 1650 1655 

tct tta tea cac aaa ata tea aaa tea aaa gta tea caa ata aat aca 
Ser Leu Ser His Lys He Ser Lys Ser Lys Val Ser Gin He Asn Thr 
1660 1665 1670 

caa tta tea tct ttt gaa ttt gat cct att eta tgg gaa aaa aaa tta 
Gin Leu Ser Ser Phe Glu Phe Asp Pro He Leu Trp Glu Lys Lys Leu 
1675 1680 1685 1690 

ggt ggg eta aga eta tct gga gat ggg cat gga act cac ttc ata ata 
Gly Gly Leu Arg Leu Ser Gly Asp Gly His Gly Thr His Phe He He 
1695 1700 1705 

atg cct ace gaa gaa ata tta ata gat gac att tec acg age gat age 
Met Pro Thr Glu Glu He Leu He Asp Asp He Ser Thr Ser Asp Ser 
1710 1715 1720 

aat aaa aca tea gag cag tct tct cgc tta gaa aaa get tta tta ggt 
Asn Lys Thr Ser Glu Gin Ser Ser Arg Leu Glu Lys Ala Leu Leu Gly 
1725 1730 1735 

ttt aca aac aca atg tac agt gat tea aac cct cct att ata get cgt 
Phe Thr Asn Thr Met Tyr Ser Asp Ser Asn Pro Pro He He Ala Arg 
1740 1745 1750 

ttt aga gac tat ctg gaa gat ggt gag tgc att gac aga att age gaa 
Phe Arg Asp Tyr Leu Glu Asp Gly Glu Cys He Asp Arg He Ser Glu 
1755 1760 1765 1770 

tea att ttt ttt aca ccg caa gaa ttc aat ctt gca gat cac cac att 
Ser lie Phe Phe Thr Pro Gin Glu Phe Asn Leu Ala Asp His His He 
1775 1780 1785 

gaa gga tgg ttc aat gaa ttt ggt caa ttc agt gga act gtt tct gtt 
Glu Gly Trp Phe Asn Glu Phe Gly Gin Phe Ser Gly Thr Val Ser Val 
1790 1795 1800 

tat ggt gaa gag cca att cat cat gtc gtg act tgg aaa aat aat aat 
Tyr Gly Glu Glu Pro He His His Val Val Thr Trp Lys Asn Asn Asn 
1805 1810 1815 
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caa tta acc caa tgc ggt cca ttt aaa ata aaa tta gcg tat att cat 5915 
Gin Leu Thr Gin Cys Gly Pro Phe Lys lie Lys Leu Ala Tyr He His 
1820 1825 1830 

ggt egg ctt cgt gat tea cgc tta ccc atg gag ttg tgg gec cct ctg 5963 
Gly Arg Leu Arg Asp Ser Arg Leu Pro Met Glu Leu Trp Ala Pro Leu 
1835 1840 1845 1850 

aag gag aaa aca gat aga tat ggt ggt tta tat ate tat cga gat gga 6011 
Lys Glu Lys Thr Asp Arg Tyr Gly Gly Leu Tyr He Tyr Arg Asp Gly 
1855 1860 1865 

tta aga att ttg ccc tat gga gat tea gat acg gat ttt eta aaa ata 6059 
Leu Arg He Leu Pro Tyr Gly Asp Ser Asp Thr Asp Phe Leu Lys He 
1870 1875 1880 

gaa aag aga aga acg tta tec get tct gaa tat ttt ttc tea tat cga 6107 
Glu Lys Arg Arg Thr Leu Ser Ala Ser Glu Tyr Phe Phe Ser Tyr Arg 
1885 1890 1895 

cgt ttg ttt gga gca ata gaa tta aca aaa gaa aac aat get tea tta 6155 
Arg Leu Phe Gly Ala He Glu Leu Thr Lys Glu Asn Asn Ala Ser Leu 
1900 1905 1910 

gtt gaa aaa get ggg cga gaa gga ttc att gaa aat aag cca tat aaa 6203 
Val Glu Lys Ala Gly Arg Glu Gly Phe He Glu Asn Lys Pro Tyr Lys 
1915 1920 1925 1930 

cag ttt aaa gaa atg ctt gaa aat ttc ttc ate gaa ate gca aga gat 6251 
Gin Phe Lys Glu Met Leu Glu Asn Phe Phe He Glu He Ala Arg Asp 
1935 1940 1945 

ttc ttt aag gac gat ggc gat atg tct gaa tta ttt gtt gag aca aag 6299 
Phe Phe Lys Asp Asp Gly Asp Met Ser Glu Leu Phe Val Glu Thr Lys 
1950 1955 I960 

caa cgt aga aat gaa gaa cat gat ttg tta tct aaa aga tct aaa caa 6347 
Gin Arg Arg Asn Glu Glu His Asp Leu Leu Ser Lys Arg Ser Lys Gin 
1965 1970 1975 

act aaa get aaa aaa gat aga tta aag aaa gat ctg tat gat ttt ttt 6395 
Thr Lys Ala Lys Lys Asp Arg Leu Lys Lys Asp Leu Tyr Asp Phe Phe 
1980 1985 1990 

gat aag tta gat aat gat tac tgg aat att gaa ata aat aag eta ate 6443 
Asp Lys Leu Asp Asn Asp Tyr Trp Asn He Glu He Asn Lys Leu He 

1995 2000 2005 2010 
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aat aaa aac gag gaa tat ttc tec agt aca gaa ata aca gac acc aat 
Asn Lys Asn Glu Glu Tyr Phe Ser Ser Thr Glu He Thr Asp Thr Asn 
2015 2020 2025 

ata gat tat gta tac aat aaa att aaa gaa caa aat gat get ate att 
He Asp Tyr Val Tyr Asn Lys lie Lys Glu Gin Asn Asp Ala lie He 
2030 2035 2040 

aaa aat eta cgt aat tct gtg gat ata aag aaa ccc tct gga gtt gga 
Lys Asn Leu Arg Asn Ser Val Asp lie Lys Lys Pro Ser Gly Val Gly 
2045 2050 2055 

tta aca aaa gag tta tct aat tta tgg gat aga tat caa ata gaa aga 
Leu Thr Lys Glu Leu Ser Asn Leu Trp Asp Arg Tyr Gin He Glu Arg 
2060 2065 2070 

caa aaa ata ctg tta tea eta aat gag eta aaa gat aac gtt gat aga 
Gin Lys He Leu Leu Ser Leu Asn Glu Leu Lys Asp Asn Val Asp Arg 
2075 2080 2085 2090 

aag ctt ata gaa ctg gat aat aaa aat aat gat ttt etc aac tta egg 
Lys Leu He Glu Leu Asp Asn Lys Asn Asn Asp Phe Leu Asn Leu Arg 
2095 2100 2105 

aag aga ctt gaa gat tct ttg aat eta caa caa agt tac tat gaa aaa 
Lys Arg Leu Glu Asp Ser Leu Asn Leu Gin Gin Ser Tyr Tyr Glu Lys 
2110 2115 2120 

gaa eta aca aag tta tat aat gac get aaa aat get ttg aaa gat gtg 
Glu Leu Thr Lys Leu Tyr Asn Asp Ala Lys Asn Ala Leu Lys Asp Val 
2125 2130 2135 

caa tct aaa gca aat agg tta att tct gat aat aag aaa aaa cat aag 
Gin Ser Lys Ala Asn Arg Leu He Ser Asp Asn Lys Lys Lys His Lys 
2140 2145 2150 

agt gaa eta aaa aac att tct tat gaa ttc caa tea act aat etc aat 
Ser Glu Leu Lys Asn He Ser Tyr Glu Phe Gin Ser Thr Asn Leu Asn 
2155 2160 2165 2170 

ggc aaa gat act gcg tat ata ttg gat gta aaa aga aat eta gaa agt 
Gly Lys Asp Thr Ala Tyr He Leu Asp Val Lys Arg Asn Leu Glu Ser 
2175 2180 2185 

aaa att gag aat act tea aac gaa gtg att aat gaa ata aga aaa eta 
Lys He Glu Asn Thr Ser Asn Glu Val He Asn Glu He Arg Lys Leu 
2190 2195 2200 
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acc gac cag att gca ata att agt gat agt acc act tct gaa aat tta 
Thr Asp Gin lie Ala lie He Ser Asp Ser Thr Thr Ser Glu Asn Leu 
2205 2210 2215 

tea teg get caa gta act gaa gca ate gaa act gaa ctt gaa cat tta 
Ser Ser Ala Gin Val Thr Glu Ala lie Glu Thr Glu Leu Glu His Leu 
2220 2225 2230 

cga gac caa caa gca aat aac gca gag tta ata eta ctt ggc atg get 
Arg Asp Gin Gin Ala Asn Asn Ala Glu Leu He Leu Leu Gly Met Ala 
2235 2240 2245 2250 

ctt tct gta gta cat cat gaa ttt aat ggt aat att agg gca att aga 
Leu Ser Val Val His His Glu Phe Asn Gly Asn He Arg Ala He Arg 
2255 2260 2265 

agt gcg eta agg gaa tta aaa gca tgg get gac aga aat cct aag ctt 
Ser Ala Leu Arg Glu Leu Lys Ala Trp Ala Asp Arg Asn Pro Lys Leu 
2270 2275 2280 

gat att ata tac caa aaa ate aga act agt ttt gat cac tta gat ggt 
Asp He He Tyr Gin Lys He Arg Thr Ser Phe Asp His Leu Asp Gly 
2285 2290 2295 

tat tta aaa acc ttt aca cca ttg aca aga cgt tta agt cgc tct aaa 
Tyr Leu Lys Thr Phe Thr Pro Leu Thr Arg Arg Leu Ser Arg Ser Lys 
2300 2305 2310 

acc aat ata act gga act gee att tta gaa ttt ate aga gat gta ttc 
Thr Asn He Thr Gly Thr Ala He Leu Glu Phe He Arg Asp Val Phe 
2315 2320 2325 2330 

gat gat cgt ctt gag aaa gaa gga att gaa tta ttc act acc tea aag 
Asp Asp Arg Leu Glu Lys Glu Gly He Glu Leu Phe Thr Thr Ser Lys 
2335 2340 2345 

ttt gtt aat caa gaa att gta act tac aca tea acc att tac cct gtc 
Phe Val Asn Gin Glu He Val Thr Tyr Thr Ser Thr He Tyr Pro Val 
2350 2355 2360 

ttt ata aat eta att gat aac gca ata tac tgg ctt ggg aaa aca act 
Phe He Asn Leu He Asp Asn Ala He Tyr Trp Leu Gly Lys Thr Thr 
2365 2370 2375 

gga gaa aaa aga ctt ata ctt gat get act gaa aca gga ttt gtt att 
Gly Glu Lys Arg Leu He Leu Asp Ala Thr Glu Thr Gly Phe Val He 
2380 2385 2390 
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ggt gat act ggt ccc ggt gtt tea act aga gat cga gat ata ata ttt 
Gly Asp Thr Gly Pro Gly Val Ser Thr Arg Asp Arg Asp lie He Phe 
2395 2400 2405 2410 

gat atg gga ttt aca cga aaa aca gga ggg cgt gga atg gga tta ttc 
Asp Met Gly Phe Thr Arg Lys Thr Gly Gly Arg Gly Met Gly Leu Phe 
2415 2420 2425 

att tec aaa gag tgt tta tct cga gat gga ttt act ata aga ttg gat 
He Ser Lys Glu Cys Leu Ser Arg Asp Gly Phe Thr He Arg Leu Asp 
2430 2435 2440 

gat tac act cct gaa cag ggt get ttc ttt att att gag cca tea gaa 
Asp Tyr Thr Pro Glu Gin Gly Ala Phe Phe He He Glu Pro Ser Glu 
2445 2450 2455 

gaa aca agt gaa tag eggatataaa taa atg aca age tct act gat ttt 
Glu Thr Ser Glu Met Thr Ser Ser Thr Asp Phe 

2460 2465 2470 

cat aaa ctt tct gaa gac tgc gtt cgc cgt ttt tta cat tct gta gtt 
His Lys Leu Ser Glu Asp Cys Val Arg Arg Phe Leu His Ser Val Val 
2475 2480 2485 

get gta gat gac aat atg tct ttt gga get ggt agt gat act ttc cct 
Ala Val Asp Asp Asn Met Ser Phe Gly Ala Gly Ser Asp Thr Phe Pro 
2490 2495 2500 

aca gac gaa gat att aat get tta gtt gat ccc gac gat gat cct aca 
Thr Asp Glu Asp He Asn Ala Leu Val Asp Pro Asp Asp Asp Pro Thr 
2505 2510 2515 

cca ata ata aca gca tea gca tec cca agg ata gaa tea act aaa tea 
Pro He He Thr Ala Ser Ala Ser Pro Arg He Glu Ser Thr Lys Ser 
2520 2525 2530 

aaa gca aag gta aaa aac cat cct ttt gat tac caa get eta gca gaa 
Lys Ala Lys Val Lys Asn His Pro Phe Asp Tyr Gin Ala Leu Ala Glu 
2535 2540 2545 2550 

get ttc gec aaa gat ggt att get tgt tgc gga tta tta get aag agt 
Ala Phe Ala Lys Asp Gly He Ala Cys Cys Gly Leu Leu Ala Lys Ser 
2S55 2560 2565 

ttt aat gtt gaa gaa aga gat ata att aca gca tea tec cac aag gca 
Phe Asn Val Glu Glu Arg Asp He He Thr Ala Ser Ser His Lys Ala 
2570 2575 2580 
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gat ata aca ata ctt gac tgg gat atg caa age gat agt ggg caa ttt 8220 
Asp lie Thr lie Leu Asp Trp Asp Met Gin Ser Asp Ser Gly Gin Phe 
2585 2590 2595 

get att gaa ata ata aaa teg ata ate gtt tea gat ata aat tct gga 8268 
Ala He Glu He He Lys Ser He He Val Ser Asp He Asn Ser Gly 
2600 2605 2610 

gga cgt tta cgt ctt ctt tct att tat act ggt gaa cat gtt act get 8316 
Gly Arg Leu Arg Leu Leu Ser He Tyr Thr Gly Glu His Val Thr Ala 
2615 2620 2625 2630 

gtt ata act aag ttg aac aat gag tta aag aaa aca tac cgt age gta 8364 
Val He Thr Lys Leu Asn Asn Glu Leu Lys Lys Thr Tyr Arg Ser Val 
2635 2640 2645 

ata aaa aat gat gat agt att ttt att gaa gat aac tat gca etc gaa 8412 
He Lys Asn Asp Asp Ser He Phe He Glu Asp Asn Tyr Ala Leu Glu 
2650 2655 2660 

caa tgg tgt ata gtt gtt att agt aaa gac gtt tat gaa aaa gat ctt 8460 
Gin Trp Cys He Val Val He Ser Lys Asp Val Tyr Glu Lys Asp Leu 
2665 2670 2675 

cca aat gtg tta ata aaa aaa ttc act aac ctt aca get ggg ttg eta 8508 
Pro Asn Val Leu He Lys Lys Phe Thr Asn Leu Thr Ala Gly Leu Leu 
2680 2685 2690 

tec aac gec gca etc tct tge att tct gaa ata aga gaa aaa acc cat 8556 
Ser Asn Ala Ala Leu Ser Cys He Ser Glu He Arg Glu Lys Thr His 
2695 2700 2705 2710 

ggg ata tta aca aaa tat aat aat aaa tta gac act gca tat gtt tec 8 604 
Gly He Leu Thr Lys Tyr Asn Asn Lys Leu Asp Thr Ala Tyr Val Ser 
2715 2720 2725 

cac ate tta aat tta ata aaa tec aag gag tea agg gca tat get tat 8652 
His He Leu Asn Leu He Lys Ser Lys Glu Ser Arg Ala Tyr Ala Tyr 
2730 2735 2740 

gaa aat get cat gat tat gca gta gat tta att tct gaa gaa ata aga 8700 
Glu Asn Ala His Asp Tyr Ala Val Asp Leu He Ser Glu Glu He Arg 
2745 2750 2755 

tea ata ttg caa ata agt gaa aac tta aag aaa tct eta age aaa aac 8748 
Ser He Leu Gin He Ser Glu Asn Leu Lys Lys Ser Leu Ser Lys Asn 
2760 2765 2770 
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tec tta tec cat tgg cct att ttt cac tat gca aaa aat ggt tgt aag 
Ser Leu Ser His Trp Pro lie Phe His Tyr Ala Lys Asn Gly Cys Lys 
2775 2780 2785 2790 

aat ttt eta tta act gga aaa aaa caa aaa gac tta tea gta gaa cat 
Asn Phe Leu Leu Thr Gly Lys Lys Gin Lys Asp Leu Ser Val Glu His 
2795 2800 2805 

eta agg aat ata etc tct get gat tct tta gaa gaa att caa cac get 
Leu Arg Asn lie Leu Ser Ala Asp Ser Leu Glu Glu lie Gin His Ala 
2810 2815 2820 

att gaa cac gca tct tta ggt aaa aag gaa tac tta age caa gat ggt 
lie Glu His Ala Ser Leu Gly Lys Lys Glu Tyr Leu Ser Gin Asp Gly 
2825 2830 2835 

gaa gaa gat aaa aag tta atg caa tta tgc tct ctg gaa ate acg cgc 
Glu Glu Asp Lys Lys Leu Met Gin Leu Cys Ser Leu Glu lie Thr Arg 
2840 2845 2850 

agg agt tta aga tat cat tct cat ata gat aat gtg tec tta aaa caa 
Arg Ser Leu Arg Tyr His Ser His lie Asp Asn Val Ser Leu Lys Gin 
2855 2860 2865 2870 

gga act tta ctt tta gat gca tat aat ttt gtc tat eta tgc ata caa 
Gly Thr Leu Leu Leu Asp Ala Tyr Asn Phe Val Tyr Leu Cys lie Gin 
2875 2880 2885 

cca tta tgt gat age gtc aga ttg cat gaa aaa gec gat ttt tta ttc 
Pro Leu Cys Asp Ser Val Arg Leu His Glu Lys Ala Asp Phe Leu Phe 
2890 2895 2900 

etc agg gga aca ctg gac gat aat aat tac aat ttg tta ate gaa gat 
Leu Arg Gly Thr Leu Asp Asp Asn Asn Tyr Asn Leu Leu lie Glu Asp 
2905 2910 2915 

gaa tat ggc ggt ttt tat aaa att aaa atg ccg gca aaa get tct aat 
Glu Tyr Gly Gly Phe Tyr Lys He Lys Met Pro Ala Lys Ala Ser Asn 
2920 2925 2930 

att att tea ttt tea ttt gga gtc gaa aat gga aac ggt gtc ate ata 
He He Ser Phe Ser Phe Gly Val Glu Asn Gly Asn Gly Val He He 
2935 2940 2945 2950 

ggg aaa aag aac aat eta gtt aat act gac tat ate tea ttc gtt cct 
Gly Lys Lys Asn Asn Leu Val Asn Thr Asp Tyr He Ser Phe Val Pro 
2955 2960 2965 



79 



WO 00/28038 



PCT/GB99/0372I 



tta etc gtt gaa aaa ata tct act cca aaa gta ttg aaa tgg ate ggg 9372 
Leu Leu Val Glu Lys lie Ser Thr Pro Lys Val Leu Lys Trp lie Gly 
2970 2975 2980 

gaa ata aaa aca acg tac gcg caa aaa ata aca act gat att gtt get 9420 
Glu lie Lys Thr Thr Tyr Ala Gin Lys lie Thr Thr Asp lie Val Ala 
2985 2990 2995 

aat ctg tea aga ata ggt tta gat caa cat gag tgg tta cga ata aaa 9468 
Asn Leu Ser Arg lie Gly Leu Asp Gin His Glu Trp Leu Arg lie Lys 
3000 3005 3010 

tea aaa gat ata taaatgatta tatatgeegt cgttttataa aaactggcgg 9520 

Ser Lys Asp lie 

3015 

catgtatatc tagttagtcc atcatagaag tcaagaaatt tagtttgece tatatcttat 9580 

agaaaatata ttttatatgc ttaaaaaaca ccatctttct aagatggcat ttatgtgctt 9640 

tgtttcgatc aattacaact gatatattac catattgatt aattttatgt tatttaccaa 9700 

agtaaeggea tcttaatata tegtcataat atagtgcgcg ttctgactct aatactgaaa 9760 

aatttatttg ttctatttta cacttactgc aaatagcatc cagtttatca tatagtgtcg 9820 

catcaattgg cgcag atg tea tea cgc caa ate ctt gag cat tat aat get 9871 
Met Ser Ser Arg Gin lie Leu Glu His Tyr Asn Ala 
3020 3025 3030 

eta aca tat ccc eta cat caa tea ate ttg ttg cag ata atg act teg 9919 
Leu Thr Tyr Pro Leu His Gin Ser lie Leu Leu Gin lie Met Thr Ser 
3035 3040 3045 

aat ttg tta tea gtt tgc act gga aaa tec att tac gag gat ate tec 9967 
Asn Leu Leu Ser Val Cys Thr Gly Lys Ser lie Tyr Glu Asp lie Ser 
3050 3055 3060 

ggc agt tct tgg aat ate ata cac ttc aat ate cct etc ccc ate tct 10015 
Gly Ser Ser Trp Asn lie lie His Phe Asn lie Pro Leu Pro lie Ser 
3065 3070 3075 

aga gcg aga ctt tec ata ttt tct tat tgt gtc aga att aaa cct tgg 10063 
Arg Ala Arg Leu Ser lie Phe Ser Tyr Cys Val Arg lie Lys Pro Trp 
3080 3085 3090 

atg agt atg gat tac atg taaccggctc atttaaaccg tctggtctgt 10111 
Met Ser Met Asp Tyr Met 
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ttcctccggt tttacaaaaa ta 



atg tec ate att ttt aat gga cac tat cgt 
Met Ser lie lie Phe Asn Gly His Tyr Arg 
3105 3110 



atg aaa cac egg act tgg ate act gaa get tta cgt ctt cac ttt gaa 10211 
Met Lys His Arg Thr Trp lie Thr Glu Ala Leu Arg Leu His Phe Glu 
3115 3120 3125 

gaa cat tta ccc cag gtt gtg gtc ggg cgt cgc ctg ggc gta cca aaa 10259 
Glu His Leu Pro Gin Val Val Val Gly Arg Arg Leu Gly Val Pro Lys 
3130 3135 3140 

tea aca get tgt ggt atg ttc gtg cgc ttt cgc aaa get ggc ttt tea 10307 
Ser Thr Ala Cys Gly Met Phe Val Arg Phe Arg Lys Ala Gly Phe Ser 
3145 3150 3155 

tgg cct ctg ccc gca ggt atg teg gag egg gag ctt gat ggc cgt ctt 10355 
Trp Pro Leu Pro Ala Gly Met Ser Glu Arg Glu Leu Asp Gly Arg Leu 
3160 3165 3170 

tac ggg agt acc tec aca gta cct gtc gta ctt tgt agt gga teg gta 10403 
Tyr Gly Ser Thr Ser Thr Val Pro Val Val Leu Cys Ser Gly Ser Val 
3175 3180 3185 3190 

att cag gac acc teg aaa tec tgt taatgttaaa acagtgaaaa tgaggtgatg 10457 
lie Gin Asp Thr Ser Lys Ser Cys 
3195 

c atg ate aaa act cgt egg act aaa cgt acc ttt tec ccg gag ttc aag 10506 
Met lie Lys Thr Arg Arg Thr Lys Arg Thr Phe Ser Pro Glu Phe Lys 
3200 3205 3210 

ctt gaa get ttc gag cag gtg gtg gtt aaa tac cag cgt gat gtc aga 10554 
Leu Glu Ala Phe Glu Gin Val Val Val Lys Tyr Gin Arg Asp Val Arg 
3215 3220 3225 3230 

gaa gtc gcg cag gca etc gag etc aac cct gac cat ttg cgt aaa tgg 10602 
Glu Val Ala Gin Ala Leu Glu Leu Asn Pro Asp His Leu Arg Lys Trp 
3235 3240 3245 

ata egg ttg tat aag cag gaa ctt cag ggt att gag cca get ggt aat 10650 
lie Arg Leu Tyr Lys Gin Glu Leu Gin Gly lie Glu Pro Ala Gly Asn 
3250 3255 3260 



get att acc cct gaa caa cgc gaa att cag cag ctt aaa gcg cag ata 10698 
Ala He Thr Pro Glu Gin Arg Glu He Gin Gin Leu Lys Ala Gin He 
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3265 3270 3275 

aag cgc gtt gag atg gaa aaa gaa ata eta aag cag get gec gtg ctg 10746 
Lys Arg Val Glu Met Glu Lys Glu lie Leu Lys Gin Ala Ala Val Leu 
3280 3285 3290 

atg age gaa ate ccc ggg aag ctg teg cgc taatcacaca gctgaaagca 10796 
Met Ser Glu lie Pro Gly Lys Leu Ser Arg 
3295 3300 

aagtggccag tgtgggttat ttgtcattta tteggtatta acegtagegt ttattacgcg 10856 
caggtgaagc gtcctgttaa tgtgcaaaga attgaattac gaagccgggt gagggctttc 10916 
catgetetea gtcgtggcgc ageegggtag ccgggcaatc agtcagatgt tgegecagag 10976 
tggcgttgat geaggceggt ggctggcatg acgactgatg egggaatgag ggctgacaag 11036 
tcgacagccg gttaaacatc acaacegggt aaacgaagac aaaagtcege cattgecaaa 11096 
tttactgaac eggcaattte accccgccgc accaaactgc gtctggtgcg gcgacatcag 11156 
ttttattcg 11165 



<210> 35 
<211> 366 
<212> PRT 

<213> Escherichia coli 
<400> 35 

Ser Asp Met Gin Arg Gly lie Gin Ala Ala Thr Ala Ala Leu Gin Gly 
15 10 15 

Leu Val Gly Gly Asn Met Ala Gly Ala Leu Ala Gly Ala Ser Ala Pro 
20 25 30 

Glu Leu Ala Asn He He Gly His His Ala Gly He Asp Asp Asn Thr 
35 40 45 

Ala Ala Lys Ala lie Ala His Ala lie Leu Gly Gly Val Thr Ala Ala 
50 55 60 

Leu Gin Gly Asn Ser Ala Ala Ala Gly Ala lie Gly Ala Gly Thr Gly 
65 70 75 80 

Glu Val He Ala Ser Ala He Ala Lys Ser Leu Tyr Pro Gly Val Asp 
85 90 95 
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Pro Ser Lys Leu Thr Glu Asp Gin Lys Gin Thr Val Ser Thr Leu Ala 
100 105 110 

Thr Leu Ser Ala Gly Met Ala Gly Gly lie Ala Ser Gly Asp Val Ala 
115 120 125 

Gly Ala Ala Ala Gly Ala Gly Ala Gly Lys Asn Val Val Glu Asn Asn 
130 135 140 

Ala Leu Ser Leu Val Ala Arg Gly Cys Ala Val Ala Ala Pro Cys Arg 
145 150 155 160 

Thr Lys Val Ala Glu Gin Leu Leu Glu lie Gly Ala Lys Ala Gly Met 
165 170 175 

Ala Gly Leu Ala Gly Ala Ala Val Lys Asp Met Ala Asp Arg Met Thr 
180 185 190 

Ser Asp Glu Leu Glu His Leu He Thr Leu Gin Met Met Gly Asn Asp 
195 200 205 

Glu He Thr Thr Lys Tyr Leu Ser Ser Leu His Asp Lys Tyr Gly Ser 
210 215 220 

Gly Ala Ala Ser Asn Pro Asn He Gly Lys Asp Leu Thr Asp Ala Glu 
225 230 235 240 

Lys Val Glu Leu Gly Gly Ser Gly Ser Gly Thr Gly Thr Pro Pro Pro 
245 250 255 

Ser Glu Asn Asp Pro Lys Gin Gin Asn Glu Lys Thr Val Asp Lys Leu 
260 265 270 

Asn Gin Lys Gin Glu Ser Ala He Lys Lys He Asp Asn Thr He Lys 
275 280 285 

Asn Ala Leu Lys Asp His Asp He He Gly Thr Leu Lys Asp Met Asp 
290 295 300 

Gly Lys Pro Val Pro Lys Glu Asn Gly Gly Tyr Trp Asp His Met Gin 
305 310 315 320 

Glu Met Gin Asn Thr Leu Arg Gly Leu Arg Asn His Ala Asp Thr Leu 
325 330 335 

Lys Asn Val Asn Asn Pro Glu Ala Gin Ala Ala Tyr Gly Arg Ala Thr 
340 345 350 
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Asp Ala lie Asn Lys lie Glu Ser Ala Leu Lys Gly Tyr Gly 
355 360 365 



<210> 36 
<211> 128 
<212> PRT 

<213> Escherichia coli 
<400> 36 

Met lie Thr Leu Arg Lys Leu lie Gly Asn lie Asn Met Thr Lys Glu 
1 5 10 15 

Pro Glu Gin Gin Ser Pro Leu Glu Leu Trp Phe Glu Arg lie lie Asp 
20 25 30 

Val Pro Leu Glu Lys Leu Thr Val Glu Asp Leu Cys Arg Ala lie Arg 
35 40 45 

Gin Asn Leu cys lie Asp Gin Leu Met Pro Arg Val Leu Glu Val Leu 
50 55 60 

Thr Lys Glu Pro Leu Ala Gly Glu Tyr Tyr Asp Gly Glu Leu lie Ala 
65 70 75 80 

Ala Leu Ser Thr lie Lys Gly Glu Asp Leu Lys Asp Gin Lys Ser Thr 
85 90 95 

Phe Thr Gin He Arg Gin Leu He Asn Gin Leu Glu Pro Ser Asp lie 
100 105 110 

Asn Asp Asp Leu Arg Lys Asp He Leu Lys He Asn Gin He He Val 
115 120 125 



<210> 37 
<211> 107 
<212> PRT 

<213> Escherichia coli 
<400> 37 

Met Val Ala Lys Ala Phe Ala Tyr Ala Leu Asn Gin Trp Pro Ala Leu 
15 10 15 

Thr Tyr Tyr Ala Asn Asp Gly Trp Val Glu He Asp Asn Asn He Ala 
20 25 30 
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Glu Asn Ala Leu Arg Ala Val Ser 
35 40 

Phe Gly Ser Asp His Gly Gly Glu 
50 55 

lie Gly Thr Cys Lys Leu Asn Asp 
65 70 

His Val Leu Ala Val lie Ala Asp 
85 

Leu Leu Pro Trp Arg He Ala Leu 
100 



Leu Gly Arg Lys Asn Phe Leu Phe 
45 

Arg Gly Ala Leu Leu Tyr Ser Leu 
60 

Val Asp Pro Glu Ser Tyr Leu Arg 
75 80 

Trp Pro Val Asn Arg Val Ser Glu 
90 95 

Pro Ala Glu 
105 



<210> 38 
<211> 86 
<212> PRT 

<213> Escherichia coli 
<400> 38 

Met Leu Met Ser Val Gin Lys Glu Lys Asn Val Ala Glu Ser Val Val 
15 10 15 

Ser Glu Thr His Thr Gly Asp Ser Val Tyr Ala Ser Leu Phe Glu Lys 
20 25 30 

He Asn Leu Asn Pro Val Ser Ala Leu Ser Ala Leu Asp Asn Pro Phe 
35 40 45 

Arg Ser Ala Asp Asn Ala Thr Gly Arg He Thr Ser Ser He Gin Pro 
50 55 60 

Ala Val Gin Cys Ala Ala Ala Ala Ala Thr Glu Gly Ser Cys Pro Arg 
65 70 75 80 

Gin Ser Pro Cys Ser Gly 
85 



<210> 39 
<211> 111 
<212> PRT 

<213> Escherichia coli 
<400> 39 

Met Val Asp Asn Trp Gin Lys Ser Val Arg Ser Arg Ala Leu Pro Glu 
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1 5 

Glu Ala Met Thr Gly Trp Asn Glu 
20 

Ala Glu Arg Leu Asn Arg Gin Asp 
35 40 

Val Ser Glu Leu Lys Thr Glu Val 
50 55 

Arg His lie Pro Ala Glu Glu Gin 
65 70 



10 15 

Gly Met lie Arg Leu Gin Gin Leu 
25 30 

Glu Gin Arg Gly Lys Tyr Met Thr 
45 

Phe Gly lie Met Gin Ala Phe Asn 
60 

Leu Arg Arg Tyr Gly Glu Val Arg 
75 80 



Asn Gin Asn Gly Ser Glu Gin Gin Gin Lys Gin Ala Glu Met Ala Leu 

85 90 95 

Asn Gin Leu lie Asn Arg Tyr Gin Met lie Arg Ala Gly Lys Gin 

100 105 110 



<210> 40 
<211> 143 
<212> PRT 

<213> Escherichia coli 
<400> 40 

Met Val Gly Cys Ala Trp Leu Ala Glu Gin Ala Phe Ser Asp His Ala 



Leu Ser Pro His Ser Ala Trp Pro Tyr Ser Ala Ser Arg Asp Ala Gly 



Leu Ala Asp Thr Gly Ala Gly Gly Tyr Pro Thr Cys Lys Gin Arg Trp 
35 40 45 

Ala Asp Asp Thr Val Gly Leu Lys Ala Arg Leu Leu Gin Leu Pro Ala 



Leu Asp lie Trp Thr Ala Phe Lys Lys lie Asp Gin Ser Gin Val Val 
65 70 75 80 

Tyr Glu Glu Ala Val Leu Arg Ser Arg Val Ser Glu Arg Asn Met Gin 
85 90 95 

Val Ser Gin Asn Gly Arg Val Tyr Pro Ser Tyr Gly Gly Asn Val Asp 
100 105 110 
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Gly Thr Val Ala Asn Ala Ala Thr Arg Leu Ala Ser Gly Ala Arg Asn 
115 120 125 

He Leu Gly Ser He Ala Ala Cys Thr Ala Phe Asp Ser Val Arg 
130 135 140 



<210> 41 
<211> 118 
<212> PRT 

<213> Escherichia coli 
<400> 41 

Met Val Gin Ala Gin Leu Gin He Ala Leu Val He Cys He Pro Leu 
15 10 15 

He Thr Leu Cys Ser Ala Trp Asp Val Lys Val Val Met Thr Leu Thr 
20 25 30 

Phe Val Gin Phe Ala Leu Phe Phe Leu Thr Phe Trp Trp Glu Leu Ala 
35 40 45 

Arg Trp Leu Asp Ser Trp Leu Leu Asp Val Leu Tyr Asn Ser Asp Thr 
50 55 60 

His Ser Ser Trp Asn Leu Ala Gly He Gin Asn Thr Gin Asp Asp Val 
65 70 75 80 

He He Asn Leu Val Met Arg Leu Met Phe Leu Val Leu Pro Thr Phe 
85 90 95 

Trp Leu Gly Ala Met Thr Trp Ala Gly Val Arg Val Gly Val Ala Leu 
100 105 110 

Asn Gly Ala Leu Ala Gly 
115 



<210> 42 
<211> 81 
<212> PRT 

<213> Escherichia coli 
<400> 42 

Met Lys Tyr Leu Phe Phe Glu Asn He His Ser He Phe Leu Thr Phe 
15 io 15 

Ser Leu Phe Arg Thr Ser Val Ser Pro Asp Phe Pro Met He Phe Ala 
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20 25 30 

Leu Pro Ser He He Leu Gly Gin Phe Thr Thr Asn Gin Leu Thr Asn 
35 40 45 

Phe Val He Cys Met Gly Asn Thr Val Glu Arg Arg Leu Gly Val Val 
50 55 60 

His Asn Pro Phe Lys Arg Ser Gly Asp Gly His Asp Leu Arg Ala Val 
65 70 75 80 

Ala 



<210> 43 
<211> 348 
<212> PRT 

<213> Escherichia coli 
<400> 43 

Leu He Val He Asp Phe Phe Cys Gly Cys Gly Gly Ala Ser Glu Gly 
15 io 15 

Leu Arg Gin Ala Gly Phe Asp He Glu Leu Gly Leu Asp He Asp Gin 
20 25 30 

Gin Ala Ser Glu Thr Phe Lys Ala Asn Phe Pro Asp Ala Lys Phe He 
35 40 45 

Gin Asp Asp He Arg Lys He Glu Pro Gin Asp He Ser Asp He He 
50 55 60 

Asp He Lys Ala Lys Arg Pro Leu Leu Leu Ser Ala Cys Ala Pro Cys 



Gin Pro Phe Ser Gin Gin Asn Lys Asn Lys Thr Ser Asp Asp Ser Arg 
85 90 95 

Arg Asn Leu Leu Asn Glu Thr His Arg Phe He Arg Glu Leu Leu Pro 
100 105 110 

Glu Tyr He Met Leu Glu Asn Val Pro Gly Met Gin Lys He Asp Glu 
115 120 125 

Glu Lys Glu Gly Pro Phe Gin Glu Phe He Lys Leu Leu Lys Glu Leu 
130 135 140 



Glu Tyr Asn Tyr He Ser Phe He Ala Asn Ala Glu Asn Tyr Gly He 
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145 150 155 160 

Pro Gin Arg Arg Lys Arg Leu Val Leu Leu Ala Ser Arg Val Gly Lys 
165 170 175 

Val Thr Leu Pro Glu lie Thr His Gly Lys Asn Lys He Pro Phe Lys 
180 185 190 

Thr Val Arg Asp Tyr He Gin Asp Phe Thr Lys Leu Cys Ser Gly Glu 
195 200 205 

Thr Asp Pro Lys Asp Pro Leu His Arg Ala Gly Thr Leu Ser Pro Leu 
210 215 220 

Asn Leu Lys Arg He Met His Thr Pro Glu Gly Gly Asp Arg Arg Asn 
225 230 235 240 

Trp Pro Glu Glu Leu Val Asn Lys Cys His Lys Asn Tyr Asp Gly His 
245 250 255 

Thr Asp Thr Tyr Gly Arg Met Ser Trp Asp Lys Pro Ala Pro Thr Leu 
260 265 270 

Thr Thr Lys Cys Asn Ser Tyr Ser Asn Gly Arg Phe Gly His Pro Asp 
275 280 285 

Pro Thr Gin His Arg Ala He Ser He Arg Glu Ala Ser Arg Leu Gin 
290 295 300 

Thr Phe Pro Leu Ser Tyr Val Phe Lys Gly Ser Leu Asn Ser Met Ala 
305 310 315 320 

Lys Gin He Gly Asn Ala Val Pro Cys Glu Leu Ala Arg Leu Phe Gly 
325 330 335 

Leu His Leu He Glu Asn Cys Thr Asn Lys Asp Ser 
340 345 



<210> 44 
<211> 974 
<212> PRT 

<213> Escherichia coli 
<400> 44 

Met Leu Gly Arg Gin Gin He Ala Gly He Pro Thr Ala Leu Ser Glu 
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Leu Phe Lys Asn Ala His Asp Ala Tyr Ala Asp Asn Val Glu Val Asp 
20 25 30 

Phe Phe Arg Lys Glu Asn Leu Leu lie Leu Arg Asp Asp Gly Leu Gly 
35 40 45 

Met Thr Thr Asp Glu Phe Glu Glu Arg Trp Leu Thr lie Gly Thr Ser 
50 55 60 

Ser Lys Leu lie Asp Asp Asp Ala lie Asn Lys Pro Ala Val Asp Ser 
65 70 75 80 

Asn Lys Ala Phe Arg Pro lie Met Gly Glu Lys Gly He Gly Arg Leu 
85 90 95 

Ser He Ala Ala lie Gly Pro Gin Val Leu Val Leu Thr Arg Ala Lys 
100 105 110 

Arg Asp Asn Glu Leu Lys Pro Leu Val Ala Ala Phe Val Asn Trp Ser 
115 120 125 

Leu Phe Ala He Pro Ser Leu Asp Leu Asp Asp He Glu He Pro He 
130 135 140 

Arg Thr He He Asn Asp Glu Cys Phe Thr Lys Lys Thr Leu Asp Glu 
145 150 155 160 

Met He Glu Gin Ala Arg Asn Asn Leu Asp Ser Leu Ser His Lys He 
165 170 175 

Ser Lys Ser Lys Val Ser Gin He Asn Thr Gin Leu Ser Ser Phe Glu 
180 185 190 

Phe Asp Pro He Leu Trp Glu Lys Lys Leu Gly Gly Leu Arg Leu Ser 
195 200 205 

Gly Asp Gly His Gly Thr His Phe He He Met Pro Thr Glu Glu He 
210 215 220 

Leu He Asp Asp He Ser Thr Ser Asp Ser Asn Lys Thr Ser Glu Gin 
225 230 235 240 

Ser Ser Arg Leu Glu Lys Ala Leu Leu Gly Phe Thr Asn Thr Met Tyr 
245 250 255 

Ser Asp Ser Asn Pro Pro He He Ala Arg Phe Arg Asp Tyr Leu Glu 
260 265 270 
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Asp Gly Glu Cys He Asp Arg He Ser Glu Ser He Phe Phe Thr Pro 
275 280 285 

Gin Glu Phe Asn Leu Ala Asp His His He Glu Gly Trp Phe Asn Glu 
290 295 300 

Phe Gly Gin Phe Ser Gly Thr Val Ser Val Tyr Gly Glu Glu Pro He 
305 310 315 320 

His His Val Val Thr Trp Lys Asn Asn Asn Gin Leu Thr Gin Cys Gly 
325 330 335 

Pro Phe Lys He Lys Leu Ala Tyr lie His Gly Arg Leu Arg Asp Ser 
340 345 350 

Arg Leu Pro Met Glu Leu Trp Ala Pro Leu Lys Glu Lys Thr Asp Arg 
355 360 365 

Tyr Gly Gly Leu Tyr He Tyr Arg Asp Gly Leu Arg He Leu Pro Tyr 
370 375 380 

Gly Asp Ser Asp Thr Asp Phe Leu Lys He Glu Lys Arg Arg Thr Leu 
385 390 395 400 

Ser Ala Ser Glu Tyr Phe Phe Ser Tyr Arg Arg Leu Phe Gly Ala He 
405 410 415 

Glu Leu Thr Lys Glu Asn Asn Ala Ser Leu Val Glu Lys Ala Gly Arg 
420 425 430 

Glu Gly Phe He Glu Asn Lys Pro Tyr Lys Gin Phe Lys Glu Met Leu 
435 440 445 

Glu Asn Phe Phe He Glu He Ala Arg Asp Phe Phe Lys Asp Asp Gly 
450 455 460 

Asp Met Ser Glu Leu Phe Val Glu Thr Lys Gin Arg Arg Asn Glu Glu 
465 470 475 480 

His Asp Leu Leu Ser Lys Arg Ser Lys Gin Thr Lys Ala Lys Lys Asp 
485 490 495 

Arg Leu Lys Lys Asp Leu Tyr Asp Phe Phe Asp Lys Leu Asp Asn Asp 
500 505 510 

Tyr Trp Asn He Glu He Asn Lys Leu He Asn Lys Asn Glu Glu Tyr 
515 520 525 
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Phe Ser Ser Thr Glu lie Thr Asp Thr Asn lie Asp Tyr Val Tyr Asn 
530 535 540 

Lys lie Lys Glu Gin Asn Asp Ala lie lie Lys Asn Leu Arg Asn Ser 
545 550 555 560 

Val Asp lie Lys Lys Pro Ser Gly Val Gly Leu Thr Lys Glu Leu Ser 
565 570 575 

Asn Leu Trp Asp Arg Tyr Gin lie Glu Arg Gin Lys lie Leu Leu Ser 
580 585 590 

Leu Asn Glu Leu Lys Asp Asn Val Asp Arg Lys Leu He Glu Leu Asp 
595 600 605 

Asn Lys Asn Asn Asp Phe Leu Asn Leu Arg Lys Arg Leu Glu Asp ser 
610 615 620 

Leu Asn Leu Gin Gin Ser Tyr Tyr Glu Lys Glu Leu Thr Lys Leu Tyr 
625 630 635 640 

Asn Asp Ala Lys Asn Ala Leu Lys Asp Val Gin Ser Lys Ala Asn Arg 
645 650 655 

Leu lie Ser Asp Asn Lys Lys Lys His Lys Ser Glu Leu Lys Asn He 
660 665 670 

Ser Tyr Glu Phe Gin Ser Thr Asn Leu Asn Gly Lys Asp Thr Ala Tyr 
675 680 685 

He Leu Asp Val Lys Arg Asn Leu Glu Ser Lys He Glu Asn Thr Ser 
690 695 700 

Asn Glu Val He Asn Glu He Arg Lys Leu Thr Asp Gin He Ala He 
705 710 715 720 

He Ser Asp Ser Thr Thr Ser Glu Asn Leu Ser Ser Ala Gin Val Thr 
725 730 735 

Glu Ala He Glu Thr Glu Leu Glu His Leu Arg Asp Gin Gin Ala Asn 
740 745 750 

Asn Ala Glu Leu He Leu Leu Gly Met Ala Leu Ser Val Val His His 
755 760 765 

Glu Phe Asn Gly Asn He Arg Ala He Arg Ser Ala Leu Arg Glu Leu 
770 775 780 
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Lys Ala Trp Ala Asp Arg Asn Pro Lys Leu Asp lie lie Tyr Gin Lys 
785 790 795 800 

lie Arg Thr Ser Phe Asp His Leu Asp Gly Tyr Leu Lys Thr Phe Thr 
805 810 815 

Pro Leu Thr Arg Arg Leu Ser Arg Ser Lys Thr Asn He Thr Gly Thr 
820 825 830 

Ala He Leu Glu Phe He Arg Asp Val Phe Asp Asp Arg Leu Glu Lys 
835 840 845 

Glu Gly He Glu Leu Phe Thr Thr Ser Lys Phe Val Asn Gin Glu He 
850 855 860 

Val Thr Tyr Thr Ser Thr He Tyr Pro Val Phe He Asn Leu He Asp 
865 870 875 880 

Asn Ala He Tyr Trp Leu Gly Lys Thr Thr Gly Glu Lys Arg Leu He 
885 890 895 

Leu Asp Ala Thr Glu Thr Gly Phe Val He Gly Asp Thr Gly Pro Gly 
900 905 910 

Val Ser Thr Arg Asp Arg Asp He He Phe Asp Met Gly Phe Thr Arg 
915 920 925 

Lys Thr Gly Gly Arg Gly Met Gly Leu Phe He Ser Lys Glu Cys Leu 
930 935 940 

Ser Arg Asp Gly Phe Thr He Arg Leu Asp Asp Tyr Thr Pro Glu Gin 
»45 950 955 960 

Gly Ala Phe Phe He He Glu Pro Ser Glu Glu Thr Ser Glu 
965 970 



<210> 45 
<211> 555 
<212> PRT 

<213> Escherichia coli 
<400> 45 

Met Thr Ser Ser Thr Asp Phe His 
1 5 

Arg Phe Leu His Ser Val Val Ala 
20 



Lys Leu Ser Glu Asp Cys Val Arg 
10 15 

Val Asp Asp Asn Met Ser Phe Gly 
25 30 
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Ala Gly Ser Asp Thr Phe Pro Thr Asp Glu Asp lie Asn Ala Leu Val 
35 40 45 

Asp Pro Asp Asp Asp Pro Thr Pro He He Thr Ala Ser Ala Ser Pro 



Arg He Glu Ser Thr Lys Ser Lys Ala Lys Val Lys Asn His Pro Phe 
65 70 75 80 

Asp Tyr Gin Ala Leu Ala Glu Ala Phe Ala Lys Asp Gly He Ala Cys 



Cys Gly Leu Leu Ala Lys Ser Phe Asn Val Glu Glu Arg Asp He He 
100 105 HO 

Thr Ala Ser Ser His Lys Ala Asp He Thr He Leu Asp Trp Asp Met 
115 120 125 

Gin Ser Asp Ser Gly Gin Phe Ala He Glu He He Lys Ser He He 
130 135 140 

Val Ser Asp He Asn Ser Gly Gly Arg Leu Arg Leu Leu Ser He Tyr 
145 150 155 160 

Thr Gly Glu His Val Thr Ala Val He Thr Lys Leu Asn Asn Glu Leu 
165 170 175 

Lys Lys Thr Tyr Arg Ser Val He Lys Asn Asp Asp Ser He Phe He 
180 185 190 

Glu Asp Asn Tyr Ala Leu Glu Gin Trp Cys He Val Val He Ser Lys 
195 200 205 

Asp Val Tyr Glu Lys Asp Leu Pro Asn Val Leu He Lys Lys Phe Thr 
210 215 220 

Asn Leu Thr Ala Gly Leu Leu Ser Asn Ala Ala Leu Ser Cys He Ser 
225 230 235 240 

Glu He Arg Glu Lys Thr His Gly He Leu Thr Lys Tyr Asn Asn Lys 
245 250 255 

Leu Asp Thr Ala Tyr Val Ser His He Leu Asn Leu He Lys Ser Lys 
260 265 270 

Glu Ser Arg Ala Tyr Ala Tyr Glu Asn Ala His Asp Tyr Ala Val Asp 
275 280 285 
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Leu lie Ser Glu Glu 
290 

Lys Lys Ser Leu Ser 
305 

Tyr Ala Lys Asn Gly 
325 

Lys Asp Leu Ser Val 
340 

Leu Glu Glu lie Gin 
355 

Glu Tyr Leu Ser Gin 
370 

Cys Ser Leu Glu He 
385 

Asp Asn Val Ser Leu 
405 

Phe Val Tyr Leu Cys 
420 

Glu Lys Ala Asp Phe 
435 

Tyr Asn Leu Leu He 
450 

Met Pro Ala Lys Ala 
465 

Asn Gly Asn Gly Val 
485 

Asp Tyr lie Ser Phe 
500 

Lys Val Leu Lys Trp 
515 

He Thr Thr Asp He 
530 



He Arg Ser He Leu Gin 
295 

Lys Asn Ser Leu Ser His 
310 315 

Cys Lys Asn Phe Leu Leu 
330 

Glu His Leu Arg Asn He 
345 

His Ala He Glu His Ala 
360 

Asp Gly Glu Glu Asp Lys 
375 

Thr Arg Arg Ser Leu Arg 
390 395 

Lys Gin Gly Thr Leu Leu 
410 

He Gin Pro Leu Cys Asp 
425 

Leu Phe Leu Arg Gly Thr 
440 

Glu Asp Glu Tyr Gly Gly 
455 

Ser Asn He He Ser Phe 
470 475 

He He Gly Lys Lys Asn 
490 

Val Pro Leu Leu Val Glu 
505 

He Gly Glu He Lys Thr 
520 

Val Ala Asn Leu Ser Arg 
535 



He Ser Glu Asn Leu 
300 

Trp Pro He Phe His 
320 

Thr Gly Lys Lys Gin 
335 

Leu Ser Ala Asp Ser 
350 

Ser Leu Gly Lys Lys 
365 

Lys Leu Met Gin Leu 
380 

Tyr His Ser His He 
400 

Leu Asp Ala Tyr Asn 
415 

Ser Val Arg Leu His 
430 

Leu Asp Asp Asn Asn 
445 

Phe Tyr Lys He Lys 
460 

Ser Phe Gly Val Glu 
480 

Asn Leu Val Asn Thr 

495 

Lys He Ser Thr Pro 
510 

Thr Tyr Ala Gin Lys 
525 

He Gly Leu Asp Gin 
540 
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His Glu Trp Leu Arg He Lys Ser Lys Asp He 
545 550 555 



<210> 46 
<211> 82 
<212> PRT 

<213> Escherichia coli 
<400> 46 

Met ser Ser Arg Gin He Leu Glu His Tyr Asn Ala Leu Thr Tyr Pro 
15 io 15 

Leu His Gin Ser He Leu Leu Gin He Met Thr Ser Asn Leu Leu Ser 



Val Cys Thr Gly Lys Ser He Tyr Glu Asp He Ser Gly Ser Ser Trp 

35 40 45 

Asn He He His Phe Asn He Pro Leu Pro He Ser Arg Ala Arg Leu 

50 55 60 

Ser He Phe Ser Tyr Cys Val Arg He Lys Pro Trp Met Ser Met Asp 

65 70 75 80 



<210> 47 
<211> 98 
<212> PRT 

<213> Escherichia coli 
<400> 47 

Met Ser He He Phe Asn Gly His Tyr Arg Met Lys His Arg Thr Trp 
15 io 15 

He Thr Glu Ala Leu Arg Leu His Phe Glu Glu His Leu Pro Gin Val 
20 25 30 

Val Val Gly Arg Arg Leu Gly Val Pro Lys Ser Thr Ala Cys Gly Met 



Phe Val Arg Phe Arg Lys Ala Gly Phe Ser Trp Pro Leu Pro Ala Gly 
50 55 60 

Met Ser Glu Arg Glu Leu Asp Gly Arg Leu Tyr Gly Ser Thr Ser Thr 
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Val Pro Val Val Leu Cys Ser Gly Ser Val lie Gin Asp Thr Ser Lys 
85 90 95 



<210> 48 
<211> 106 
<212> PRT 

<213> Escherichia coli 
<400> 48 

Met lie Lys Thr Arg Arg Thr Lys Arg Thr Phe Ser Pro Glu Phe Lys 



Leu Glu Ala Phe Glu Gin Val Val Val Lys Tyr Gin Arg Asp Val Arg 



Glu Val Ala Gin Ala Leu Glu Leu Asn Pro Asp His Leu Arg Lys Trp 



lie Arg Leu Tyr Lys Gin Glu Leu Gin Gly He Glu Pro Ala Gly Asn 



Ala He Thr Pro Glu Gin Arg Glu He Gin Gin Leu Lys Ala Gin He 



Lys Arg Val Glu Met Glu Lys Glu He Leu Lys Gin Ala Ala Val Leu 
85 90 95 



Met Ser Glu He Pro Gly Lys Leu Ser Arg 
100 105 



<210> 49 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Oligonucleotide 
<400> 49 

tgctctagag ccattactca gaatggg 
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<210> 50 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Oligonucleotide 
<400> 50 

cgcgagctcg acgactgaat gatccc 26 

<210> 51 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Oligonucleotide 
<400> 51 

tcccccgggt actgcagcac tcaacc 26 

<210> 52 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Oligonucleotide 
<400> 52 

gatcccggga ccactgaaat gcgtgc 26 

<210> 53 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence oligonucleotide 
<400> 53 

tcgtctagag atgatggtga tggagcg 27 
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<210> 54 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Oligonucleotide 
<400> 54 

gaactgcagc caaatactga taccaccc 28 

<210> 55 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Oligonucleotide 
<400> 55 

gaactgcagg ctaaaacaga agacgcg 27 

<210> 56 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Oligonucleotide 
<400> 56 

catgcatgca ctccatatga caaccgc 27 

<210> 57 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: oligonucleotide 
<400> 57 

tcgtctagaa tgaagctgcg catgagg 27 
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<210> 58 
<211> 27 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Oligonucleotide 
<400> 58 

caactgcagt cgcaaattgc gaactgg 



<210> 59 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Oligonucleotide 
<400> 59 

caactgcaga ccgcaacttt tcgacgc 



<210> 60 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Oligonucleotide 
<400> 60 

catgcatgcc agtgagccat tgttccc 



<210> 61 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence Oligonucleotide 
<400> 61 

tgctctagat acgactctga caggagg 
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<210> 62 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Oligonucleotide 
<400> 62 

tcagatatca actaccagca gtttgg 

<210> 63 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Oligonucleotide 
<400> 63 

tcagatatcc ataaagagtg acgtggc 

<210> 64 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Oligonucleotide 
<400> 64 

tgctctagaa aacgtggcaa cagagcg 

<210> 65 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence Oligonucleotide 
<400> 65 

tgctctagaa ggcgttgtcg atcctg 
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<210> 66 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Oligonucleotide 
<400> 66 

gaactgcagg aaaaggccga gcagactg 

<210> 67 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Oligonucleotide 
<400> 67 

gaactgcagt acagccatgt ttacggt 

<210> 68 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Oligonucleotide 
<400> 68 

catgcatgcg gtgtacgaca gtttgcg 

<210> 69 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Oligonucleotide 
<400> 69 

tgctctagac acatcatggg cacacc 
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<210> 70 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Oligonucleotide 
<400> 70 

gaactgcaga accgtccaca tcaggcg 27 



<210> 71 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<22 0> 

<223> Description of Artificial Sequence: Oligonucleotide 
<400> 71 

gaactgcaga ccctgcttgc cattccg 27 



<210> 72 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Oligonucleotide 



<4O0> 72 

catgcatgca taagcgtcga acaggcg 



27 



